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PHYTYUPRENYLTRANSFERASE NUCLEIC ACIDS. POLYPEPTIDES 

AND USES THEREOF 

5 

TECHNICAL FIELD 

The present invention relates generally to plant molecular biology. More 
specifically, it relates to nucleic acids and methods for modulating their expression 
10 in plants. 

BACKGROUND OF THE INVENTION 

The chloroplasts of higher plants contain and elaborate many unique, 
interconnected biochemical pathways that produce an array of compounds that 

15 not only perform vita! plastid functions but are also important from agricultural and 
nutritional perspectives. One class of lipid soluble, chloroplastically synthesized 
compounds are the prenyllipids, plastoquinone and tocopherols. Plastoquinone is 
a fundamentally important component of the chloroplast photosynthetic electron 
transport chain and accounts for up to 50% of the total prenyllipid pool in green 

20 tissues. Tocopherols collectively account for up to 40% of the total prenyllipids 
pool in green plastids and have a well documented role in mammals as an 
antioxidant [Liebler, 1993] and a similar, though less well understood antioxidant 
role in plants [Hess, 1993]. The essential nutritional value of tocopherols has been 
known for over 70 years [Mason, 1980]. Despite the well studied, wide-spread 

25 importance of these chloroplastic compounds to human nutrition/ agriculture and 
biochemical processes within plant cells, much remains to be learned at the 
molecular level about their biosynthesis. 

Plastoquinone and tocopherols are the most abundant prenyllipids in the 
plastid and are synthesized by the common pathway reviewed in Hess, 1993 and 

30 Soil, 1987. The head group for both compounds, homogentisic acid, is produced 
from p-hydroxyphenylpyruvic acid by the enzyme p-hydroxyphenylpyruvic acid 
dioxygenase in a reaction that catalyzes both an oxidation and decarboxylation. 
Homogentisic acid is subject to phytylation/prenylation (phytyl and solanyl, C20 
and C45, respectively) coupled to a simultaneous decarboxylation to form the first 
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true tocopherol and plastoquinone intermediates, 2-demethyl-phytylplastoquinol 
and 2-demethylplastoquinol-9, respectively. A single ring methylation occurs on 2- 
demethylplastoquinol to yield plastoquinol-9 that is then oxidized to plastoquinone- 
9. The preferred route in spinach for a-tocopherol formation is thought to be 1) 
ring methylation of 2-demethylphytylplastoquinol, to yield phytylplastoquinol, 2) 
cyclization to yield gamma-tocopherol and, finally, 3) a second ring methylation to 
yield a-tocopherol. The first ring methylation in both tocopherol and plastoquinone 
synthesis is thought to be carried out by a single enzyme that is specific for the 
sight of methylation on the ring but has broad substrate specificity and 
accommodates both classes of compounds. The final methylation enzyme 
(gamma tocopherol methyl transferase) is the only enzyme of the pathway that 
has been purified from plants to date (dHarlingue and Camara, 1985). All other 
enzymatic activities of tocopherol/plastoquinone synthesis have been localized to 
the inner chloroplast envelope by fractionation studies except p- 
hydroxyphenylpyruvic acid dioxygenase and the tocopherol cyclase enzyme. 
Difficulties with cell fractionation methods, low activities for some of the enzymes, 
substrate stability and availability and assay problems make studying the pathway 
biochemically extremely challenging. 

The fact that tocopherol and plastoquinone levels, ratios and total amounts 
vary by orders of magnitude in different plant tissues and developmental stages 
indicates the pathway is both highly regulated and highly flexible and has potential 
for quantitative and qualitative manipulation. However, while biochemical analysis 
has been useful in deciphering the biosynthetic pathway such studies have 
provided almost no insight into how bulk carbon flow through the pathway is 
regulated or how differing amounts of tocopherols or plastoquinone are 
synthesized. 



SUMMARY OF THE INVENTION 

It is an object of the present invention to provide nucleic acids and 
polypeptides relating to the biosynthesis of tocopherol and plastiquinone. 

It is another object of the present invention to provide nucleic acids and 
polypeptides that can be used to identify proteins involved in tocopherol and 
plastiquinone biosynthesis. 
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l| is another object of the present invention to provide antigenic fragments 
of the polypeptides of the present invention. 

It is another object of the present invention to provide transgenic plants 
comprising the nucleic acids of the present invention. 

It is another object of the present invention to provide methods for 
modulating, in a transgenic plant, the expression of the nucleic acids of the 
present invention. 

It is another object of the present invention to provide a method for 
modulating the level of tocopherols and plastiquinone in a plant. 

Other aspects of the present invention include expression cassettes 
comprising the nucleic acid operably linked to a promoter, host cells transfected 
with the expression cassette, and transgenic plants and seeds comprising the 
expression cassette. 

In a further aspect, the present invention relates to a method of modulating 
expression of the nucleic acids in a plant, comprising the steps of 

(a) transforming a plant cell with an expression cassette comprising a 
nucleic acid of the present invention operably linked to a promoter in 
sense or antisense orientation; 

(b) growing the plant cell under plant growing conditions to produce a 
regenerated plant capable of expressing the nucleic acid for a time 
sufficient to modulate expression of the nucleic acids in the plant 
compared to a corresponding non-transformed plant. 

Expression of the nucleic acids encoding the proteins of the present 
invention can be increased or decreased relative to a non-transformed control 
plant. 

DETAILED DESCRIPTION OF THE INVENTION 

Tocopherols are synthesized in the inner plastid membrane. The first 
committed step in the pathway is the condensation of the homogentisate head 
group with the phytyl tail catalyzed by an integral membrane protein: 
homogentisate: phytyl transferase. The present polypeptides catalyze the 
condensation of homogentisic acid with phytyldiphosphate or geranylgeranyl 
pyrophosphate to produce the first intermediates in tocopherol or tocotrienol 
synthesis, respectively. 
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The phytylation/prenylation of homogentisic acid is a likely key regulatory 
step for tail" synthesis and in determining the relative amounts of tocopherols, 
tocotrienols and plastoquinone produced as it is the branchpoint for the tocopherol 
and plastoquinone arms of the pathway. 
5 One purpose of this invention is to modulate a prenyllipid biosynthetic 

pathway, such as the plastoquinone and tocopherol pathways. The modulation of 
the pathway may be an up regulation or down regulation of the amount or activity 
of a prenyllipid (ie. plastoquinone or tocopherol), or of an intermediate in a 
pathway (ie. 2-demethyl-phytylplastoquinol or 2-demethylplastoquinol-9). 

10 

DEFINITIONS 

The term "isolated" refers to material, such as a nucleic acid or a protein, 
which is: (1) substantially or essentially free from components which normally 
accompany or interact with the material as found in its naturally occurring 

15 environment or (2) if the material is in its natural environment, the material has 
been altered by deliberate human intervention to a composition and/or placed at a 
locus in the cell other than the locus native to the material. 

The terms polypeptide, "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid 

20 polymers in which one or more amino acid residue is an artificial chemical 

analogue of a corresponding naturally occurring amino acid, as well as to naturally 
occurring amino acid polymers. The essential nature of such analogues of 
naturally occurring amino acids is that, when incorporated into a protein, that 
protein is specifically reactive to antibodies elicited to the same protein but 

25 consisting entirely of naturally occurring amino acids. The terms "polypeptide", 
"peptide" and "protein" are also inclusive of modifications including, but not limited 
to, glycosylate, lipid attachment, sulfation, gamma-carboxylation of glutamic acid 
residues, hydroxylation and ADP-ribosylation. Further, this invention contemplates 
the use of both the methionine-containing and the methionine-less amino terminal 

30 variants of the protein of the invention. 

As used herein, "plant" includes but is not limited to plant cells, plant tissue 
and plant seeds. 
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As used herein, "promoter" includes reference to a region of DNA upstream 
from the start of transcription and involved in recognition and binding of RNA 
polymerase and other proteins to initiate transcription. 

By "fragment" is intended a portion of the nucleotide sequence or a portion 
5 of the amino acid sequence and hence protein encoded thereby. Preferably 
fragments of a nucleotide sequence may encode protein fragments that retain the 
biological activity of the native nucleic acid. However, fragments of a nucleotide 
sequence which are useful as hybridization probes generally do not encode 
fragment proteins retaining biological activity. Fragments of a nucleotide 

10 sequence are generally greater than 10 nucleotides, preferably at least 20 
nucleotides and up to the entire nucleotide sequence encoding the proteins of the 
invention. Generally probes are less than 1000 nucleotides and preferably less 
than 500 nucleotides. Fragments of the invention include antisense sequences 
used to decrease expression of the inventive nucleic acids. Such antisense 

15 fragments may vary in length ranging from at least about 20 nucleotides, about 50 
nucleotides, about 100 nucleotides, up to and including the entire coding 
sequence. 

By "variants" is intended substantially similar sequences. Generally, 
nucleic acid sequence variants of the invention will have at least 50%, 60%, 70%, 

20 or preferably 80%, more preferably at least 90% and most preferably at least 95% 
sequence identity to the native nucleotide sequence. 

Generally, polypeptide sequence variants of the invention will have at least 
about 55%, 60%, 70%, 80%, or preferably at least about 90% and more preferably 
at least about 95% sequence identity to the native protein. 

25 As used herein, "sequence identity" or "identity" in the context of two nucleic 

acid or polypeptide sequences includes reference to the residues in the two 
sequences that are the same when aligned for maximum correspondence over a 
specified comparison window. When percentage of sequence identity is used in 
reference to proteins it is recognized that residue positions which are not identical 

30 often differ by conservative amino acid substitutions, where amino acid residues 
are substituted for other amino acid residues with similar chemical properties (e.g. 
charge or hydrophobicity) and therefore do not change the functional properties of 
the molecule. Where sequences differ for conservative substitutions, the percent 
identity may be adjusted upward to correct for the conservative nature of the 
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substitution. Means for making this adjustment are well known to those skilled in 
the art, and typically involve scoring a conservative substitution as a partial rather 
than a full mismatch. 

Methods of alignment of sequences for comparison are well-known in the 
5 art. Two methods are used herein to define the present invention. The first is the 
BLAST 2.0 suite of programs using default parameters. Altschul et a/., Nucleic 
Acids Res. 25:3389-3402 (1997). Software for performing BLAST analyses is 
publicly available, e.g., through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). The second is the GAP program, available as part 

10 of the Wisconsin Genetics Software Package, that uses the algorithm of 
Needleman and Wunsch (J. Mol. Bi ol. 48:443-453, 1970) to. find the alignment of 
two complete sequences that maximizes the number of matches and minimizes 
the number of gaps. Default gap creation penalty values and gap extension 
penalty values in Version 10 of the Wisconsin Genetics Software Package for 

15 nucleotide sequences are 50 and 3, respectively, and for protein sequences are 8 
and 2, respectively. Unless otherwise specified, references to the GAP program 
or algorithm refer to the GAP program or algorithm in version 10 of the Wisconsin 
Genetics Software Package. The gap creation and gap extension penalties can 
be expressed as an integer selected from the group of integers consisting of from 

20 0 to 200. Thus, for example/the gap creation and gap extension penalties can be 
0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65 or greater. 
The scoring matrix used in Version 10 of the Wisconsin Genetics Software 
Package is BLOSUM62 (see Henikoff & Henikoff (1989) Proc. NatL Acad. Sci. 
USA 89:10915). 

25 When GAP is used to compute % sequence identities for sequences of 

differing length, results determined by GAP may be reduced for non-overlapping 
nucleotides or amino acids in the longer sequence. For example, if a sequence of 
100 is compared to a sequence of 40, GAP may determine the percent identity to 
be 100% if the 40 nucleotides or amino acids of the shorter sequence match 40 

30 nucleotides or amino acids of the larger sequence. This is because GAP may 
calculate the percent identity based on the total length of the shorter sequence. 
However, where this specification, including the claims, specifies the sequence 
identity being computed by GAP, the GAP percentage identity should be re- 
calculated as a percentage of the longer sequence and any nucleotides or amino 
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acids in the larger sequence that extend beyond the shorter sequence would not 
count as a match. In the example provided above this would give a percent 
identity of 40%. 

Other methods of alignment of sequences for comparison are well-known in 

5 the art. Optimal alignment of sequences for comparison may be conducted by the 
local homology algorithm of Smith and Waterman, Adv. Appi Math. 2:482 (1981); 
by the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 
48:443 (1970); by the search for similarity method of Pearson and Lipman, Proc. 
Natl. Acad. ScL 85:2444 (1988); by computerized implementations of these 

10 algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by 
Intelligenetics, Mountain View, California; GAP, BESTFIT, FASTA, BLAST and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group 
(GCG), 575 Science Dr., Madison, Wisconsin, USA; the CLUSTAL program is well 
described by Higgins and Sharp, Gene 73:237-244 (1988); Higgins and Sharp, 

15 CABIOS 5:151-153 (1989); Corpet et al. t Nucleic Acids Research 16:10881-90 
(1988); Huang et a/., Computer Applications in the Biosciences 8:155-65 (1992), 
and Pearson et al., Methods in Molecular Biology 24:307-331 (1994). 

The BLAST family of programs which can be used for database similarity 
searches includes: BLASTN for nucleotide query sequences against nucleotide 

20 database sequences; BLASTX for nucleotide query sequences against protein 
database sequences; BLASTP for protein query sequences against protein 
database sequences; TBLASTN for protein query sequences against nucleotide 
database sequences; and TBLASTX for nucleotide query sequences against 
nucleotide database sequences. See Current Protocols in Molecular Biology, 

25 Chapter 19, Ausubel et a/., Eds., Greene Publishing and Wiley-lnterscience, New 
York (1995). Software for performing BLAST analyses is publicly available, e.g., 
through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). The BLAST algorithm performs a statistical 
analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, 

30 Proc. Natl Acad. Sci. USA 90:5873-5877 (1993)). One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)) ( which 
provides an indication of the probability by which a match between two nucleotide 
or amino acid sequences would occur by chance. 
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The term "functional equivalent" means that the sequence of the variant 
polynucleotide defines a chain that produces a protein having substantially the 
same biological effect as the protein encoded by the non-variant polynucleotide. 

The term "Complement" or "Complementary" when used with respect to a 
polynucleotide sequence refers to the corresponding base pairs in the same 
sequence. 

The term "Hybridization Probe" refers to a process whereby a 
polynucleotide is used to find a complementary polynucloetide through the 
annealing of the two polynucleotides to form a double helix. 

The term "Coding Sequence" when used with respect to a complete gene 
sequence refers to the sequence spanning the start and stop codon, and when 
used with respect to a partial gene sequence refers to a portion of the coding 
region spanning the start and stop codon. 



NUCLEIC ACIDS 

The isolated nucleic acids of the present invention can be made using (a) 
standard recombinant methods, (b) synthetic techniques, or combinations thereof. 
In some embodiments, the polynucleotides of the present invention will be cloned, 
amplified, or otherwise constructed from a monocot or dicot In preferred 
embodiments the monocot is com, sorghum, barley, wheat, millet, or rice. 
Preferred dicots include soybeans, sunflower, canola, alfalfa, cotton, potato, 
cassava, Arabidopsis thaliana. tomato, Bmssica vegetables, peppers, potatoes, 
apples, spinach, or lettuce. 

Functional fragments included in the invention can be obtained using 
primers that selectively hybridize under stringent conditions. Primers are generally 
at least 12 bases in length and can be as high as 200 bases, but will generally be 
from 15 to 75, preferably from 15 to 50. Functional fragments can be identified 
using a variety of techniques such as restriction analysis, Southern analysis, 
primer extension analysis, and DNA sequence analysis. 

The present invention includes a plurality of polynucleotides that encode for 
the identical amino acid sequence. The degeneracy of the genetic code allows for 
such "silent variations" which can be used, for example, to selectively hybridize 
and detect allelic variants of polynucleotides of the present invention. Additionally, 
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the present invention includes isolated nucleic acids comprising allelic variants. 
The term "allele" as used herein refers to a related nucleic acid of the same gene. 

Variants of nucleic acids included in the invention can be obtained, for 
example, by oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, 
5 mutagenesis using the polymerase chain reaction, and the like. See, for example, 
Ausubel, pages 8.0.3 - 8.5.9. Also, see generally, McPherson (ed.), DIRECTED 
MUTAGENESIS: A Practical approach, (IRL Press, 1991). Thus, the present 
invention also encompasses DNA molecules comprising nucleotide sequences 
that have substantial sequence identity with the inventive sequences. 

io Variants included in the invention may contain individual substitutions, 

deletions or additions to the nucleic acid or polypeptide sequences. Such 
changes will alter, add or delete a single amino acid or a small percentage of 
amino acids in the encoded sequence. Variants are referred to as "conservatively 
modified variants" where the alteration results in the substitution of an amino acid 

15 with a chemically similar amino acid. When the nucleic acid is prepared or altered 
synthetically, advantage can be taken of known codon preferences of the intended 
host. 

The present invention also includes "shufflents" produced by sequence 
shuffling of the inventive polynucleotides to obtain a desired characteristic. 

20 Sequence shuffling is described in PCT publication No. 96/19256. See also, 
Zhang, J. H., et a/., Proc. Natl. Acad. Sci. USA 94:4504-4509 (1997). 

The present invention also includes the use of 5' and/or 3* UTR regions for 
modulation of translation of heterologous coding sequences. Positive sequence 
motifs include translational initiation consensus sequences (Kozak, Nucleic Acids 

25 Res.15:8125 (1987)) and the 7-methylguanosine cap structure (Drummond et a/., 
Nucleic Acids Res. 13:7375 (1985)). Negative elements include stable 
intramolecular 5' UTR stem-loop structures (Muesing et a/. f Cell 48:691 (1987)) 
and AUG sequences or short open reading frames preceded by an appropriate 
AUG in the 5' UTR (Kozak, supra, Rao et aL, Mol. and Cell. Biol. 8:284 (1988)). 

30 Further, the polypeptide-encoding segments of the polynucleotides of the 

present invention can be modified to alter codon usage. Altered codon usage can 
be employed to alter translational efficiency and/or to optimize the coding 
sequence for expression in a desired host or to optimize the codon usage in a 
heterologous sequence for expression in maize. Codon usage in the coding 
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regions of the polynucleotides of the present invention can be analyzed 
statistically using commercially available software packages such as "Codon 
Preference" available from the University of Wisconsin Genetics Computer Group 
(see Devereaux et a/., Nucleic Acids Res. 12:387-395 (1984)) or MacVector 4.1 
(Eastman Kodak Co., New Haven, Conn.). 

For example, the inventive nucleic acids can be optimized for enhanced 
expression in organisms of interest. See, for example, EPA0359472; 
W091/16432; Perlak et a/., Proc. Natl. Acad. ScL USA 88:3324-3328 (1991); and 
Murray et a/.. Nucleic Acids Res. 77:477-498 (1989). In this manner, the genes 
can be synthesized utilizing species-preferred codons. See, for example, Murray 
et a/., Nucleic Acids Res. 77:477-498 (1989), the disclosure of which is 
incorporated herein by reference. 

The present invention provides subsequences comprising isolated nucleic 
acids containing at least 16 contiguous bases of the inventive sequences. For 
example the isolated nucleic acid includes those comprising at least 20, 25, 30, 
40, 50, 60, 75 or 100 or more contiguous nucleotides of the inventive sequences. 
Subsequences of the isolated nucleic acid can be used to modulate or detect gene 
expression by introducing into the subsequences compounds which bind, 
intercalate, cleave and/or crosslink to nucleic acids. 

The nucleic acids of the invention may conveniently comprise a multi- 
cloning site comprising one or more endonuclease restriction sites inserted into 
the nucleic acid to aid in isolation of the polynucleotide. Also, translatable 
sequences may be inserted to aid in the isolation of the translated polynucleotide 
of the present invention. For example, a hexa-histidine marker sequence provides 
a convenient means to purify the proteins of the present invention. 

A polynucleotide of the present invention can be attached to a vector, 
adapter, promoter, transit peptide or linker for cloning and/or expression of a 
polynucleotide of the present invention. Additional sequences may be added to 
such cloning and/or expression sequences to optimize their function in cloning 
and/or expression, to aid in isolation of the polynucleotide, or to improve the 
introduction of the polynucleotide into a cell. Use of cloning vectors, expression 
vectors, adapters, and linkers is well known and extensively described in the art. 
For a description of such nucleic acids see, for example, Stratagene Cloning 
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Systems, Catalogs 1995, 1996, 1997 (La Jolla, CA); and, Amersham Life 
Sciences, Inc. Catalog '97 (Arlington Heights, IL). 

The isolated nucleic acid compositions of this invention, such as RNA, 
cDNA, genomic DNA, or a hybrid thereof, can be obtained from plant biological 

5 sources using any number of cloning methodologies known to those of skill in the 
art. In some embodiments, oligonucleotide probes that selectively hybridize, 
under stringent conditions, to the polynucleotides of the present invention are used 
to identify the desired sequence in a cDNA or genomic DNA library. 

Exemplary total RNA and mRNA isolation protocols are described in Plant 

10 Molecular Biology: A Laboratory Manual, Clark, Ed. t Springer-Verlag, Berlin 
(1997); and, Current Protocols in Molecular Biology, Ausubel, et a/. t Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995). Total RNA and mRNA 
isolation kits are commercially available from vendors such as Stratagene (La 
Jolla, CA), Clonetech (Palo Alto, CA), Pharmacia (Piscataway, NJ) ( and 5-3' 

15 (Paoli, PA). See also, U.S. Patent Nos. 5,614,391; and, 5,459,253. 

Typical cDNA synthesis protocols are well known to the skilled artisan and 
are described in such standard references as: Plant Molecular Biology: A 
Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin (1997); and, Current 
Protocols in Molecular Biology, Ausubel, et ai, Eds., Greene Publishing and 

20 Wiley-lnterscience, New York (1995). cDNA synthesis kits are available from a 
variety of commercial vendors such as Stratagene or Pharmacia. 

An exemplary method of constructing a greater than 95% pure full-length 
cDNA library is described by Carninci et a/., Genomics 37:327-336 (1996). Other 
methods for producing full-length libraries are known in the art. See, e.g., Edery et 

25 a/., MoL Cell 6/o/.15(6):3363-3371 (1995); and, PCT Application WO 96/34981 . 

It is often convenient to normalize a cDNA library to create a library in which 
each clone is more equally represented. A number of approaches to normalize 
cDNA libraries are known in the art. Construction of normalized libraries is 
described in Ko, Nucl. Acids. Res. 18(19):5705-5711 (1990); Patanjali ef a/. f Proc. 

30 Natl. Acad. USA 88:1943-1947 (1991); U.S. Patents 5,482,685 and 5,637,685; 
and Soares etai, Proc. Natl. Acad. ScL USA 91:9228-9232 (1994). 

Subtracted cDNA libraries are another means to increase the proportion of 
less abundant cDNA species. See, Foofe et al. in, Plant Molecular Biology: A 
Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin (1997); Kho and Zarbl, 
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Technique, 3(2):58-63 (1991); Sive and St. John, Nucl. Acids Res. 16(22):10937 
(1988); Current Protocols in Molecular Biology, Ausubel, et a/., Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995); and, Swaroop et a/., Nucl. 
Acids Res. 19(8):1954 (1991). cDNA subtraction kits are commercially available. 
5 See, e.g., PCR-Select (Clontech). 

To construct genomic libraries, large segments of genomic DNA are 
generated by random fragmentation. Examples of appropriate molecular 
biological techniques and instructions are found in Sambrook, et a/.. Molecular 
Cloning: A Laboratory Manual. 2nd Ed., Cold Spring Harbor Laboratory Vols. 1-3 

10 (1989), Methods in Enzymology, Vol. 152: Guide to Molecular Cloning 
Techniques, Berger and Kimmel, Eds., San Diego: Academic Press, Inc. (1987). 
Current Protocols in Molecular Biology, Ausubel, et a/., Eds.. Greene Publishing 
and Wiley-lnterscience. New York (1995); Plant Molecular Biology: A Laboratory 
Manual, Clark. Ed., Springer-Verlag, Berlin (1997). Kits for construction of 

15 genomic libraries are also commercially available. 

The cDNA or genomic library can be screened using a probe based upon 
the sequence of a nucleic acid of the present invention such as those disclosed 
herein. Probes may be used to hybridize with genomic DNA or cDNA sequences 
to isolate homologous genes in the same or different plant species. Those of skill 

20 in the art will appreciate that various degrees of stringency of hybridization can be 
employed in the assay; and either the hybridization or the wash medium can be 
stringent. The degree of stringency can be controlled by temperature, ionic 
strength, pH and the presence of a partially denaturing solvent such as 
formamide. 

25 Typically, stringent hybridization conditions will be those in which the salt 

concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion 
concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 
30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long 
probes (e.g., greater than 50 nucleotides). Typically the hybridization will be 

30 conducted for about 4 to about 1 2 hours. 

Preferably the hybridization is conducted under low stringency conditions 
which include hybridization with a buffer solution of 30 % formamide, 1 M NaCI, 
1 % SDS (sodium dodecyl sulfate) at 37°C. and a wash in 1X to 2X SSC (20X SSC 
= 3.0 M NaCI/0.3 M trisodium citrate) at 50°C. More preferably the hybridization is 
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conducted under moderate stringency conditions which include hybridization in 40 
% formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 0.5X to 1X SSC at 55°C. 
Most preferably the hybridization is conducted under high stringency conditions 
which include hybridization in 50% formamide, 1 M NaCI, 1% SDS at 37°C, and a 

5 wash in 0.1X SSC at 60°C. 

An extensive guide to the hybridization of nucleic acids is found in Tijssen, 
Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with 
Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and 
the strategy of nucleic acid probe assays", Elsevier, New York (1993); and Current 

10 Protocols in Molecular Biology, Chapter 2, Ausubel, et a/., Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995). Often, cDNA libraries will be 
normalized to increase the representation of relatively rare cDNAs. 

The nucleic acids of the invention can be amplified from nucleic acid 
samples using amplification techniques. For instance, polymerase chain reaction 

15 (PCR) technology can be used to amplify the sequences of polynucleotides of the 
present invention and related genes directly from genomic DNA or cDNA libraries. 
PCR and other in vitro amplification methods may also be useful, for example, to 
clone nucleic acid sequences that code for proteins to be expressed, to make 
nucleic acids to use as probes for detecting the presence of the desired mRNA in 

20 samples, for nucleic acid sequencing, or for other purposes. 

Examples of techniques useful for in vitro amplification methods are found 
in Berger, Sambrook, and Ausubel, as well as Mullis et a/., U.S. Patent No. 
4,683,202 (1987); and/ PCR Protocols A Guide to Methods and Applications, Innis 
et a/., Eds., Academic Press Inc., San Diego, CA (1990). Commercially available 

25 kits for genomic PCR amplification are known in the art. See, e.g., Advantage-GC 
Genomic PCR Kit (Clontech). The T4 gene 32 protein (Boehringer Mannheim) 
can be used to improve yield of long PCR products. 

PCR-based screening methods have also been described. Wilfinger et ai 
describe a PCR-based method in which the longest cDNA is identified in the first 

30 step so that incomplete clones can be eliminated from study. BioTechniques, 
22(3): 481-486 (1997). 

In one aspect of the invention, nucleic acids can be amplified from a plant 
nucleic acid library. The nucleic acid library may be a cDNA library, a genomic 
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library, or a library generally constructed from nuclear transcripts at any stage of 
intron processing. Libraries can be made from a variety of plant tissues. 

Alternatively, the sequences of the invention can be used to isolate 
corresponding sequences in other organisms, particularly other plants, more 
particularly, other monocots. In this manner, methods such as PCR, hybridization, 
and the like can be used to identify such sequences having substantial sequence 
identity to the sequences of the invention. See, for example, Sambrook et al. 
(1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor 
Laboratory Press. Plainview, New York), and Innis et al. (1990), PCR Protocols: A 
Guide to Methods and Applications (Academic Press, New York). Coding 
sequences isolated based on their sequence identity to the entire inventive coding 
sequences set forth herein or to fragments thereof are encompassed by the 
present invention. 

The isolated nucleic acids of the present invention can also be prepared by 
direct chemical synthesis by methods such as the phosphotriester method of 
Narang et al., Meth. Enzymol. 68:90-99 (1979); the phosphodiester method of 
Brown et al., Meth. Enzymol. 68:109-151 (1979); the diethylphosphoramidite 
method of Beaucage et al., Terra. Lett. 22:1859-1862 (1981); the solid phase 
phosphoramidite Wester method described by Beaucage and Caruthers, Tetra. 
Letts. 22(20):1 859-1 862 (1981), e.g., using an automated synthesizer, e.g., as 
described in Needham-VanDevanter et al., Nucleic Acids Res., 12:6159-6168 
(1984); and, the solid support method of U.S. Patent No. 4,458,066. Chemical 
synthesis generally produces a single stranded oligonucleotide. This may be 
converted into double stranded DNA by hybridization with a complementary 
sequence, or by polymerization with a DNA polymerase using the single strand as 
a template. One of skill will recognize that while chemical synthesis of DNA is 
limited to sequences of about 100 bases, longer sequences may be obtained by 
the ligation of shorter sequences. 

EXPRESSION CASSETTES 
In another embodiment expression cassettes comprising isolated nucleic 
acids of the present invention are provided. An expression cassette will typically 
comprise a polynucleotide of the present invention operably linked to 
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transcriptional initiation regulatory sequences which will direct the transcription of 
the polynucleotide in the intended host cell, such as tissues of a transformed plant. 

The construction of expression cassettes that can be employed in 
conjunction with the present invention is well known to those of skill in the art in 
5 light of the present disclosure. See, e.g., Sambrook, et a/.; Molecular Cloning: A 
Laboratory Manual : Cold Spring Harbor, New York; (1989); Gelvin, ef a/.; Plant 
Molecular Biology Manual : (1990); Plant Biotechnology: Commercial Prospects 
and Problems , eds. Prakash, et a/.; Oxford & IBH Publishing Co.; New Delhi, 
India; (1993); and Heslot, et a/.; Molecular Biology and Genetic Engineering of 
10 Yeasts : CRC Press, Inc., USA; (1992); each incorporated herein in its entirety by 
reference. 

For example, plant expression vectors may include (1) a cloned plant 
nucleic acid under the transcriptional control of 5* and 3' regulatory sequences and 
(2) a dominant selectable marker. Such plant expression vectors may also 

15 contain, if desired, a promoter regulatory region (e.g., one conferring inducible, 
constitutive, environmentally- or developmentally-regulated, or cell- or tissue- 
specific/selective expression), a transcription initiation start site, a ribosome 
binding site, an RNA processing signal, a transcription termination site, and/or a 
polyadenylation signal. 

20 Constitutive, tissue-preferred or inducible promoters can be employed. 

Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 
35S transcription initiation region, the 1- or 2 - promoter derived from T-DNA of 
Agrobactehum tumefaciens, the ubiquitin 1 promoter, the Smas promoter, the 
cinnamyl alcohol dehydrogenase promoter (U.S. Patent No. 5,683,439), the Nos 

25 promoter, the pEmu promoter, the rubisco promoter, the GRP1-8 promoter and 
other transcription initiation regions from various plant genes known to those of 
skill. 

Examples of inducible promoters are the Adh1 promoter which is inducible 
by hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, 
30 and the PPDK promoter which is inducible by light. Also useful are promoters 
which are chemically inducible. 

Examples of promoters under developmental control include promoters that 
initiate transcription preferentially in certain tissues, such as leaves, roots, fruit, 
seeds, or flowers. An exemplary promoter is the anther specific promoter 5126 
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(U.S. Patent Nos. 5,689,049 and 5,689,051). Examples of seed-preferred 
promoters include, but are not limited to, 27 kD gamma zein promoter and waxy 
promote, BoronatA, Martinez,M.C., Reina,M., Puigdomenech,P. and Pa!au,J.; 
Isolation and sequencing of a 28 kD glutelin-2 gene from maize: Common 
5 elements in the 5' flanking regions among zein and glutelin genes; Plant Sci. 47, 
95-102 (1986) and Reina.M., Ponte.l., Guillen.P., BoronatA and Palau.J., 
Sequence analysis of a genomic clone encoding a Zc2 protein from Zea mays 
W64 A, Nucleic Acids Res. 18 (21), 6426 (1990). See the following site relating to 
the waxy promoter Kloesgen.R.B., Gierl.A., Schwarz-Sommer f ZS. and 

io Saed!er,H., Molecular analysis of the waxy locus of Zea mays, Mol. Gen. Genet. 
203, 237-244 (1986). Promoters that express in the embryo, pericarp, and 
endosperm are disclosed in US applications Ser. Nos. 60/097,233 filed August 20, 
1998 and 60/098,230 filed August 28, 1998. The disclosures each of these are 
incorporated herein by reference in their entirety. 

15 Either heterologous or non-heterologous (i.e., endogenous) promoters can 

be employed to direct expression of the nucleic acids of the present invention. 
These promoters can also be used, for example, in expression cassettes to drive 
expression of antisense nucleic acids to reduce, increase, or alter concentration 
and/or composition of the proteins of the present invention in a desired tissue. 

20 If polypeptide expression is desired, it is generally desirable to include a 

polyadenylation region at the 3'-end of a polynucleotide coding region. The 
polyadenylation region can be derived from the natural gene, from a variety of 
other plant genes, or from T-DNA. The 3' end sequence to be added can be 
derived from, for example, the nopaline synthase or octopine synthase genes, or 

25 alternatively from another plant gene, or less preferably from any other eukaryotic 
gene. 

An intron sequence can be added to the 5' untranslated region or the 
coding sequence of the partial coding sequence to increase the amount of the 
mature message that accumulates. See for example Buchman and Berg, Mol 
30 Cell Biol. 8:4395-4405 (1988); Callis et a/., Genes Dev. 1:1 183-1200 (1987). Use 
of maize introns Adh1-S intron 1, 2, and 6, the Bronze-1 intron are known in the 
art. See generally, The Maize Handbook, Chapter 116, Freeling and Walbot, 
Eds., Springer, New York (1994). 
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The vector comprising the sequences from a polynucleotide of the present 
invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. Usually, the selectable marker gene will encode 
antibiotic or herbicide resistance. Suitable genes include those coding for 
5 resistance to the antibiotic spectinomycin or streptomycin (e.g., the aada gene), 
the streptomycin phosphotransferase (SPT) gene coding for streptomycin 
resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin 
or geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding 
for hygromycin resistance. 

10 Suitable genes coding for resistance to herbicides include those which act 

to inhibit the action of acetolactate synthase (ALS), in particular the 
sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene 
containing mutations leading to such resistance in particular the S4 and/or Hra 
mutations), those which act to inhibit action of glutamine synthase, such as 

15 phosphinothricin or basta (e.g., the bar gene), or other such genes known in the 
art. The bar gene encodes resistance to the herbicide basta and the ALS gene 
encodes resistance to the herbicide chlorsulfuron. 

Typical vectors useful for expression of nucleic acids in higher plants are 
well known in the art and include vectors derived from the tumor-inducing (Ti) 

20 plasmid of Agrobacterium tumefaciens described by Rogers et a/. f Meth. In 
Enzymol. 153:253-277 (1987). Exemplary A tumefaciens vectors useful herein 
are plasmids pKYLX6 and pKYLX7 of Schardl et a/., Gene 61:1-11 (1987) and 
Berger ef a/., Proc. Natl. Acad. Sci. USA 86:8402-8406 (1989). Another useful 
vector herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. 

25 (Palo Alto, CA). 

A variety of plant viruses that can be employed as vectors are known in the 
art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, 
and tobacco mosaic virus. 

A polynucleotide of the present invention can be expressed in either sense 

30 or anti-sense orientation as desired. In plant cells, it has been shown that 
antisense RNA inhibits gene expression by preventing the accumulation of mRNA 
which encodes the enzyme of interest, see, e.g., Sheehy ef a/., Proc. Natl Acad. 
Sci USA 85: 8805-8809 (1988); and Hiatt et a/., U.S. Patent No. 4,801,340. 
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Another method of suppression is sense suppression. Introduction of 
nucleic acid configured in the sense orientation has been shown to be an effective 
means by which to block the transcription of target genes. For an example of the 
use of this method to modulate expression of endogenous genes see, Napoli et 
a/., The Plant Cell 2: 279^289 (1990) and U.S. Patent No. 5.034.323. 

A method of down-regulation of the protein involves using PEST sequences 
that provide a target for degradation of the protein. 

Catalytic RNA molecules or ribozymes can also be used to inhibit 
expression of plant genes. The inclusion of ribozyme sequences within antisense 
RNAs confers RNA-cleaving activity upon them, thereby increasing the activity of 
the constructs. The design and use of target RNA-specific ribozymes is described 
in Haseloff et a/., Nature 334:585-591 (1988). 

A variety of cross-linking agents, alkylating agents and radical generating 
species as pendant groups on polynucleotides of the present invention can be 
used to bind, label, detect, and/or cleave nucleic acids. For example, Vlassov, V. 
V.. et at., Nucleic Acids Res (1986) 14:4065-4076, describe covalent bonding of a 
single-stranded DNA fragment with alkylating derivatives of nucleotides 
complementary to target sequences. A report of similar work by the same group is 
that by Knorre. D. G. f et al, Biochimie (1985) 67:785-789. Iverson and Dervan 
also showed sequence-specific cleavage of single-stranded DNA mediated by 
incorporation of a modified nucleotide which was capable of activating cleavage 
(J. Am. Chem. Soc. (1987) 109:1241-1243). Meyer. R. B., et al., J. Am. Chem. 
Soc. (1989) 111:8517-8519, effect covalent crosslinking to a target nucleotide 
using an alkylating agent complementary to the single-stranded target nucleotide 
sequence. A photoactivated crosslinking to single-stranded oligonucleotides 
mediated by psoralen was disclosed by Lee, B. L, et al., Biochemistry (1988) 
27:3197-3203. Use of crosslinking in triple-helix forming probes was also 
disclosed by Home, et al., J. Am. Chem. Soc. (1990) 112:2435-2437. Use of N4, 
N4-ethanocytosine as an alkylating agent to crosslink to single-stranded 
oligonucleotides^ also been described by Webb and Matteucci, J. Am. Chem. 
Soc. (1986) 108:2764-2765; Nucleic Acids Res. (1986) 14:7661-7674; Feteritz et 
al., J. Am. Chem. Soc. 113:4000 (1991). Various compounds to bind, detect, 
label, and/or cleave nucleic acids are known in the art. See, for example, U.S. 
Patent Nos. 5.543,507; 5.672.593; 5.484.908; 5.256,648; and, 5,681941. 
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PROTEINS 

Proteins of the present invention include proteins derived from the native 
protein by deletion (so-called truncation), addition or substitution of one or more 

5 amino acids at one or more sites in the native protein. Such variants may result 
from, for example, genetic polymorphism or from human manipulation. Methods 
for such manipulations are generally known in the art. 

For example, amino acid sequence variants of the polypeptide can be 
prepared by mutations in the cloned DNA sequence encoding the native protein of 

10 interest. Methods for mutagenesis and nucleotide sequence alterations are well 
known in the art. See, for example, Walker and Gaastra, eds. (1983) Techniques 
in Molecular Biology (MacMillan Publishing Company, New York); Kunkel (1985) 
Proc. Natl. Acad. ScL USA 82:488-492; Kunkel et a/. (1987) Methods Enzymol. 
154:367-382; Sambrook et a/. (1989) Molecular Cloning: A Laboratory Manual 

is (Cold Spring Harbor, New York); U.S. Patent No. 4,873,192; and the references 
cited therein; herein incorporated by reference. Guidance as to appropriate amino 
acid substitutions that do not affect biological activity of the protein of interest may 
be found in the model of Dayhoff et a/. (1978) Atlas of Protein Sequence and 
Structure (Natl. Biomed. Res. Found., Washington, D.C.), herein incorporated by 

20 reference. Conservative substitutions, such as exchanging one amino acid with 
another having similar properties, may be preferred. 

In constructing variants of the proteins of interest, modifications to the 
nucleotide sequences encoding the variants will be made such that variants 
continue to possess the desired activity. Obviously, any mutations made in the 

25 DNA encoding the variant protein must not place the sequence out of reading 
frame and preferably will not create complementary regions that could produce 
secondary mRNA structure. See EP Patent Application Publication No. 75,444. 

The isolated proteins of the present invention include a polypeptide 
comprising at least 23 contiguous amino acids encoded by any one of the nucleic 

30 acids of the present invention, or polypeptides which are conservatively modified 
variants thereof. The proteins of the present invention or variants thereof can 
comprise any number of contiguous amino acid residues from a polypeptide of the 
present invention, wherein that number is selected from the group of integers 
consisting of from 23 to the number of residues in a full-length polypeptide of the 
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present invention. Optionally, this subsequence of contiguous amino acids is at 
least 25, 30, 35, or 40 amino acids in length, often at least 50, 60, 70. 80, or 90 
amino acids in length. > 

The present invention includes catalytically active polypeptides (i.e., 
enzymes). Catalytically active polypeptides will generally have a specific activity 
of at least 20%, 30%, or 40%, and preferably at least 50%, 60%, or 70%, and 
most preferably at least 80%, 90%. or 95% that of the native (non-synthetic), 
endogenous polypeptide. Further, the substrate specificity (kcat/M is optionally 
substantially similar to the native (non-synthetic), endogenous polypeptide. 
Typically, the K„ will be at least 30%, 40%, or 50%, that of the native (non- 
synthetic), endogenous polypeptide; and more preferably at least 60%, 70%, 80%, 
or 90%. Methods of assaying and quantifying measures of enzymatic activity and 
substrate specificity (kcat/K™), are well known to those of skill in the art. 

The present invention includes modifications that can be made to an 
inventive protein without diminishing its biological activity. Some modifications 
may be made to facilitate the cloning, expression, or incorporation of the targeting 
molecule into a fusion protein. Such modifications are well known to those of skill 
in the art and include, for example, a methionine added at the amino terminus to 
provide an initiation site, or additional amino acids (e.g., poly His) placed on either 
terminus to create conveniently located restriction sites or termination codons or 
purification sequences. 

A protein of the present invention can be expressed in a recombinantly 
engineered cell such as bacteria, yeast, insect, mammalian, or preferably plant 
cells. The cells produce the protein in a non-natural condition (e.g.. in quantity, 
composition, location, and/or time), because they have been genetically altered 
through human intervention to do so. 

Typically, an intermediate host cell will be used in the practice of this 
invention to increase the copy number of the cloning vector. With an increased 
copy number, the vector containing the nucleic acid of interest can be isolated in 
significant quantities for introduction into the desired plant cells. 

Host cells that can be used in the practice of this invention include 
prokaryotes, including bacterial hosts such as Eschericia coli. Salmonella 
typhimurium. and Serratia marcescens. Eukaryotic hosts such as yeast or 
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filamentous fungi may also be used in this invention. It preferred to use plant 
promoters that do not cause expression of the polypeptide in bacteria. 

Commonly used prokaryotic control sequences include promoters such as 
the beta lactamase (penicillinase) and lactose (lac) promoter systems (Chang et 
a/., Nature 198:1056 (1977)), the tryptophan (trp) promoter system (Goeddel et a/ M 
Nucleic Acids Res. 8:4057 (1980)) and the lambda derived P L promoter and N- 
gene ribosome binding site (Shimatake et a/., Nature 292:128 (1981)). The 
inclusion of selection markers in DNA vectors transfected in £. coli is also useful. 
Examples of such markers include genes specifying resistance to ampicillin, 
tetracycline, or chloramphenicol. 

The vector is selected to allow introduction into the appropriate host cell. 
Bacterial vectors are typically of plasmid or phage origin. Expression systems for 
expressing a protein of the present invention are available using Bacillus sp. and 
Salmonella (Palva, et a/. ( Gene 22:229-235 (1983); Mosbach, et a/., Nature 
302:543-545(1983)). 

Synthesis of heterologous proteins in yeast is well known. See Sherman, 
F. f et a/., Methods in Yeast Genetics, Cold Spring Harbor Laboratory (1982). Two 
widely utilized yeast for production of eukaryotic proteins are Sacchammyces 
cerevisiae and Pichia pastoris. Vectors, strains, and protocols for expression in 
Sacchammyces and Pichia are known in the art and available from commercial 
suppliers (e.g., Invitrogen). Suitable vectors usually have expression control 
sequences, such as promoters, including 3-phosphoglycerate kinase or alcohol 
oxidase, and an origin of replication, termination sequences and the like as 
desired. 

A protein of the present invention, once expressed, can be isolated from 
yeast by lysing the cells and applying standard protein isolation techniques to the 
lysates. The monitoring of the purification process can be accomplished by using 
Western blot techniques or radioimmunoassay of other standard immunoassay 
techniques. 

The proteins of the present invention can also be constructed using non- 
cellular synthetic methods. Solid phase synthesis of proteins of less than about 50 
amino acids in length may be accomplished by attaching the Oterminal amino 
acid of the sequence to an insoluble support followed by sequential addition of the 
remaining amino acids in the sequence. Techniques for solid phase synthesis are 
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described by Barany and Merrifield, Solid-Phase Peptide Synthesis, pp. 3-284 in 
The Peptides: Analysis, Synthesis, Biology Vol. 2: Special Methods in Peptide 
Synthesis, Part A; Merrifield, et a/., J. Am. Chem. Soc. 85:2149-2156 (1963), and 
Stewart et a/., Solid Phase Peptide Synthesis, 2nd ed. t Pierce Chem. Co., 
Rockford, III. (1984). Proteins of greater length may be synthesized by 
condensation of the amino and carboxy termini of shorter fragments. Methods of 
forming peptide bonds by activation of a carboxy terminal end (e.g., by the use of 
the coupling reagent N.N'-dicycylohexylcarbodiimide) is known to those of skill. 

The proteins of this invention may be purified to substantial purity by 
standard techniques well known in the art, including detergent solubilization, 
selective precipitation with such substances as ammonium sulfate, column 
chromatography, immunopurification methods, and others. See, for instance, R. 
Scopes, Protein Purification: Principles and Practice, Springer-Verlag: New York 
(1982); Deutscher, Guide to Protein Purification, Academic Press (1990). For 
example, antibodies may be raised to the proteins as described herein. 
Purification from £. coli can be achieved following procedures described in U.S. 
Patent No. 4,51 1 ,503. Detection of the expressed protein is achieved by methods 
known in the art and include, for example, radioimmunoassays, Western blotting 
techniques or immunoprecipitation. 

The present invention further provides a method for modulating (i.e., 
increasing or decreasing) the concentration or composition of the polypeptides of 
the present invention in a plant or part thereof. Modulation of the polypeptides can 
be effected by increasing or decreasing the concentration and/or the composition 
of the polypeptides in a plant. The method comprises transforming a plant cell 
with an expression cassette comprising a polynucleotide of the present invention 
to obtain a transformed plant cell, growing the transformed plant cell under plant 
forming conditions, and expressing the polynucleotide in the plant for a time 
sufficient to modulate concentration and/or composition of the polypeptides in the 
plant or plant part. 

In some embodiments, the content and/or composition of polypeptides of 
the present invention in a plant may be modulated by altering, in vivo or in vitro, 
the promoter of a non-isolated gene of the present invention to up- or down- 
regulate gene expression. In some embodiments, the coding regions of native 
genes of the present invention can be altered via substitution, addition, insertion, 
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or deletion to decrease activity of the encoded enzyme. See, e.g., Kmiec, U.S. 
Patent 5,565,350; Zarling et a/., PCT/US93/03868. 

In some embodiments, an isolated nucleic acid (e.g., a vector) comprising a 
promoter sequence is transfected into a plant cell. Subsequently, a plant cell 
comprising the isolated nucleic acid is selected for by means known to those of 
skill in the art such as, but not limited to, Southern blot, DNA sequencing, or PCR 
analysis using primers specific to the promoter and to the nucleic acid and 
detecting amplicons produced therefrom. A plant or plant part altered or modified 
by the foregoing embodiments is grown under plant forming conditions for a time 
sufficient to modulate the concentration and/or composition of polypeptides of the 
present invention in the plant. Plant forming conditions are well known in the art. 

In general, concentration of the polypeptides is increased or decreased by 
at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% relative to a 
native control plant, plant part, or cell lacking the aforementioned expression 
cassette. Modulation in the present invention may occur during and/or 
subsequent to growth of the plant to the desired stage of development. 

Modulating nucleic acid expression temporally and/or in particular tissues 
can be controlled by employing the appropriate promoter operably linked to a 
polynucleotide of the present invention in, for example, sense or antisense 
orientation as discussed in greater detail above. Induction of expression of a 
polynucleotide of the present invention can also be controlled by exogenous 
administration of an effective amount of inducing compound. Inducible promoters 
and inducing compounds that activate expression from these promoters are well 
known in the art. 

In preferred embodiments, the polypeptides of the present invention are 
modulated in monocots or dicots, preferably com, soybean, sunflower, sorghum, 
canola, wheat, alfalfa, cotton, rice, barley, millet, Arabidopsis thaliana, tomato, 
Brassica vegetables, peppers, potatoes, apples, spinach, or lettuce. 

Means of detecting the proteins of the present invention are not critical 
aspects of the present invention. In a preferred embodiment, the proteins are 
detected and/or quantified using any of a number of well recognized 
immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110; 
4,517,288; and 4,837,168). For a review of the general immunoassays, see also 
Methods in Cell Biology, Vol. 37; Antibodies in Cell Biology, Asai, Ed., Academic 
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Press, Inc. New York (1993); Basic and Clinical Immunology 7th Edition, Stites & 
Terr, Eds. (1991). Moreover, the immunoassays of the present invention can be 
performed in any of several configurations, e.g., those reviewed in Enzyme 
Immunoassay, Maggio, Ed., CRC Press, Boca Raton, Florida (1980); Tijan, 
Practice and Theory of Enzyme Immunoassays, Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers B.V., 
Amsterdam (1985); Harlow and Lane, supra; Immunoassay: A Practical Guide, 
Chan, Ed., Academic Press, Orlando, FL (1987); Principles and Practice of 
Immunoassays, Price and Newman Eds., Stockton Press, NY (1991); and A/on- 
isotopic Immunoassays, Ngo, Ed., Plenum Press, NY (1988). 

Typical methods for detecting proteins include Western blot (immunoblot) 
analysis, analytic biochemical methods such as electrophoresis, capillary 
electrophoresis, high performance liquid chromatography (HPLC), thin layer 
chromatography (TLC), hyperdiffusion chromatography, and the like, and various 
immunological methods such as fluid or gel precipitin reactions, immunodiffusion 
(single or double), Immunoelectrophoresis, radioimmunoassays (RIAs), enzyme- 
linked immunosorbent assays (ELISAs), immunofluorescent assays, and the like. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then 
binds to an anti-ligand (e.g., streptavidin) molecule that is either inherently 
detectable or covalently bound to a signal system, such as a detectable enzyme, a 
fluorescent compound, or a chemiluminescent compound. A number of ligands 
and anti-ligands can be used. Where a ligand has a natural anti-ligand, for 
example, biotin, thyroxine, and Cortisol, it can be used in conjunction with the 
labeled, naturally occurring anti-ligands. Alternatively, any haptenic or antigenic 
compound can be used in combination with an antibody. 

The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fiuorophore. Enzymes of 
interest as labels will primarily be hydrolases, particularly phosphatases, esterases 
and glycosidases, or oxidoreductases, particularly peroxidases. Fluorescent 
compounds include fluorescein and its derivatives, rhodamine and its derivatives, 
dansyl, umbelliferone, etc. Chemiluminescent compounds include luciferin, and 
2,3-dihydrophthalazinediones, e.g., luminol. For a review of various labeling or 
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signal producing systems which may be used, see, U.S. Patent No. 4,391,904, 
which is incorporated herein by reference. 

Some assay formats do not require the use of labeled components. For 
instance, agglutination assays can be used to detect the presence of the target 
antibodies. In this case, antigen-coated particles are agglutinated by samples 
comprising the target antibodies. In this format, none of the components need be 
labeled and the presence of the target antibody is detected by simple visual 
inspection. 

The proteins of the present invention can be used for identifying 
compounds that bind to (e.g., substrates), and/or increase or decrease (i.e., 
modulate) the enzymatic activity of, catalytically active polypeptides of the present 
invention. The method comprises contacting a polypeptide of the present 
invention with a compound whose ability to bind to or modulate enzyme activity is 
to be determined. The polypeptide employed will have at least 20%, preferably at 
least 30% or 40%, more preferably at least 50% or 60%, and most preferably at 
least 70% or 80% of the specific activity of the native, full-length polypeptide of the 
present invention (e.g., enzyme). Methods of measuring enzyme kinetics are well 
known in the art. See, e.g., Segel, Biochemical Calculations, 2 nd ed., John Wiley 
and Sons, New York (1976). 

Antibodies can be raised to a protein of the present invention, including 
individual, allelic, strain, or species variants, and fragments thereof, both in their 
naturally occurring (full-length) forms and in recombinant forms. Additionally, 
antibodies are raised to these proteins in either their native configurations or in 
non-native configurations. Anti-idiotypic antibodies can also be generated. Many 
methods of making antibodies are known to persons of skill. 

In some instances, it is desirable to prepare monoclonal antibodies from 
various mammalian hosts, such as mice, rodents, primates, humans, etc. 
Description of techniques for preparing such monoclonal antibodies are found in, 
e.g., 8as/c and Clinical Immunology, 4th ed., Stites et a/., Eds., Lange Medical 
Publications, Los Altos, CA, and references cited therein; Harlow and Lane, 
Supra\ Goding, Monoclonal Antibodies: Principles and Practice, 2nd ed., 
Academic Press, New York, NY (1986); and Kohler and Milstein, Nature 256: 495- 
497(1975). 



WO 00/68393 PCT/USOO/l 1439 

-26- 

Other suitable techniques involve selection of libraries of recombinant 
antibodies in phage or similar vectors (see, e.g., Huse et a/., Science 246:1275- 
1281 (1989); and Ward, et a/.. Nature 341:544-546 (1989); and Vaughan et a/. f 
Nature Biotechnology, 14:309-314 (1996)). Alternatively, high avidity human 
5 monoclonal antibodies can be obtained from transgenic mice comprising 
fragments of the unrearranged human heavy and light chain Ig loci (i.e., minilocus 
transgenic mice). Fishwild et a/. f Nature Biotech., 14:845-851 (1996). Also, 
recombinant immunoglobulins may be produced. See, Cabilly, U.S. Patent No. 
4,816,567; and Queen etaL, Proc. Natl Acad. Sci. 86:10029-10033 (1989). 

10 The antibodies of this invention can be used for affinity chromatography in 

isolating proteins of the present invention, for screening expression libraries for 
particular expression products such as normal or abnormal protein or for raising 
anti-id iotypic antibodies which are useful for detecting or diagnosing various 
pathological conditions related to the presence of the respective antigens. 

15 Frequently, the proteins and antibodies of the present invention will be 

labeled by joining, either covalently or non-covalently, a substance which provides 
for a detectable signal. A wide variety of labels and conjugation techniques are 
known and are reported extensively in both the scientific and patent literature. 
Suitable labels include radionucleotides, enzymes, substrates, cofactors, 

20 inhibitors, fluorescent moieties, chemiluminescent moieties, magnetic particles, 
and the like. 



Transformation of Cells 

The method of transformation/transfection is not critical to the invention; 

25 various methods of transformation or transfection are currently available. As 
newer methods are available to transform crops or other host cells they may be 
directly applied. Accordingly, a wide variety of methods have been developed to 
insert a DNA sequence into the genome of a host cell to obtain the transcription 
and/or translation of the sequence to effect phenotypic changes in the organism. 

30 Thus, any method that provides for efficient transformation/transfection may be 
employed. 

A DNA sequence coding for the desired polynucleotide of the present 
invention, for example a cDNA, RNA or a genomic sequence, will be used to 
construct an expression cassette that can be introduced into the desired plant. 
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Isolated nucleic acid acids of the present invention can be introduced into plants 
according techniques known in the art. Generally, expression cassettes as 
described above and suitable for transformation of plant cells are prepared. 

Techniques for transforming a wide variety of higher plant species are well 
known and described in the technical, scientific, and patent literature. See, for 
example, Weising et a/., Ann. Rev. Genet 22:421-477 (1988). For example, the 
DNA construct may be introduced directly into the genomic DNA of the plant cell 
using techniques such as electroporation, PEG-mediated transfection, particle 
bombardment, silicon fiber delivery, or microinjection of plant cell protoplasts or 
embryogenic callus. See, e.g., Tomes, et a/., Direct DNA Transfer into Intact Plant 
Cells Via Microprojectile Bombardment, pp. 197-2 13 in Plant Cell, Tissue and 
Organ Culture, Fundamental Methods, eds. O. L. Gamborg and G.C. Phillips. 
Springer-Verlag Berlin Heidelberg New York, 1995. Alternatively, the DNA 
constructs may be combined with suitable T-DNA flanking regions and introduced 
into a conventional Agrobacterium tumefaciens host vector. The virulence 
functions of the Agrobacterium tumefaciens host will direct the insertion of the 
construct and adjacent marker into the plant cell DNA when the cell is infected by 
the bacteria. See, U.S. Patent No. 5,591,616. 

The introduction of DNA constructs using polyethylene glycol precipitation 
is described in Paszkowski et ai, Embo J. 3:2717-2722 (1984). Electroporation 
techniques are described in Fromm et ai, Proc. Natl. Acad. Sci. 82:5824 (1985). 
Ballistic transformation techniques are described in Klein et al., Nature 327:70-73 
(1987). 

Agrobacterium fu/nefac/ens-meditated transformation techniques are well 
described in the scientific literature. See, for example Horsch et a/., Science 
233:496-498 (1984), and Fraley et a/., Proc. Natl. Acad. Sci. 80:4803 (1983). For 
instance, Agrobacterium transformation of maize is described in U.S. Patent No. 
5,550,318. 

Other methods of transfection or transformation include (1) Agrobacterium 
rhizogenes-mediated transformation (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, vol. 6, PWJ Rigby, Ed., London, Academic Press, 1987; and 
Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., 
Oxford, IRI Press, 1985), Application PCT/US87/02512 (WO 88/02405 published 
Apr. 7, 1988) describes the use of A. rhizogenes strain A4 and its Ri plasmid along 
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with A. tumefaciens vectors pARC8 or pARC16 (2) liposome-mediated DNA 
uptake (see, e.g.. Freeman et a/., Plant Cell Physiol. 25:1353, 1984), (3) the 
vortexing method (see, e.g., Kindle, Proa Natl. Acad. ScL USA 87:1228, (1990). 
DNA can also be introduced into plants by direct DNA transfer into pollen 
5 as described by Zhou et a/., Methods in Enzymology, 101:433 (1983); D. Hess, 
Intern Rev. Cytol., 107:367 (1987); Luo et a/., Plane Mol. Biol. Reporter, 6:165 
(1988). Expression of polypeptide coding nucleic acids can be obtained by 
injection of the DNA into reproductive organs of a plant as described by Pena et 
a/., Nature 325:274 (1987). DNA can also be injected directly into the cells of 

10 immature embryos and the rehydration of desiccated embryos as described by 
Neuhaus et a/., Theor. Appl. Genet., 75:30 (1987); and. Benbrook et a/., in 
Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 (1986). 

Animal and lower eukaryotic (e.g., yeast) host cells are competent or 
rendered competent for transfection by various means. There are several well- 

15 known methods of introducing DNA into animal cells. These include: calcium 
phosphate precipitation, fusion of the recipient cells with bacterial protoplasts 
containing the DNA, treatment of the recipient cells with liposomes containing the 
DNA, DEAE dextran, electroporation, biolistics, and micro-injection of the DNA 
directly into the cells. The transfected cells are cultured by means well known in 

20 the art. Kuchler, R.J., Biochemical Methods in Cell Culture and Virology, Dowden, 
Hutchinson and Ross, Inc. (1977). 

Transgenic Plant Regeneration 

Transformed plant cells which are derived by any of the above 
25 transformation techniques can be cultured to regenerate a whole plant which 
possesses the transformed genotype. Such regeneration techniques often rely on 
manipulation of certain phytohormones in a tissue culture growth medium, typically 
relying on a biocide and/or herbicide marker which has been introduced together 
with a polynucleotide of the present invention. For transformation and 
30 regeneration of maize see, Gordon-Kamm et al 9 The Plant Cell, 2:603-618 (1990). 

Plants cells transformed with a plant expression vector can be regenerated, 
e.g., from single cells, callus tissue or leaf discs according to standard plant tissue 
culture techniques. It is well known in the art that various cells, tissues, and 
organs from almost any plant can be successfully cultured to regenerate an entire 
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plant. Plant regeneration from cultured protoplasts is described in Evans et a/., 
Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, Macmillan 
Publishing Company, New York, pp. 124-176 (1983); and Binding, Regeneration 
of Plants, Plant Protoplasts, CRC Press, Boca Raton, pp. 21-73 (1985). 

The regeneration of plants containing the foreign gene introduced by 
Agrobacterium can be achieved as described by Horsch et a/., Science, 227:1229- 
1231 (1985) and Fraley et ai, Proc. Natl. Acad. ScL USA 80:4803 (1983). This 
procedure typically produces shoots within two to four weeks and these 
transfonmant shoots are then transferred to an appropriate root-inducing medium 
containing the selective agent and an antibiotic to prevent bacterial growth. 
Transgenic plants of the present invention may be fertile or sterile. 

Regeneration can also be obtained from plant callus, explants, organs, or 
parts thereof. Such regeneration techniques are described generally in Klee et a/., 
Ann. Rev. of Plant Phys. 38:467-486 (1987). The regeneration of plants from 
either single plant protoplasts or various explants is well known in the art. See, for 
example, Methods for Plant Molecular Biology, A. Weissbach and H. Weissbach, 
eds., Academic Press, Inc., San Diego, Calif. (1988). For maize cell culture and 
regeneration see generally; The Maize Handbook, Freeling and Walbot, Eds., 
Springer, New York (1994); Com and Com Improvement, 3 rd edition, Sprague and 
Dudley Eds., American Society of Agronomy, Madison, Wisconsin (1988). 

One of skill will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be 
introduced into other plants by sexual crossing. Any of a number of standard 
breeding techniques can be used, depending upon the species to be crossed. 

In vegetatively propagated crops, mature transgenic plants can be 
propagated by the taking of cuttings or by tissue culture techniques to produce 
multiple identical plants. Selection of desirable transgenics is made and new 
varieties are obtained and propagated vegetatively for commercial use. In seed 
propagated crops, mature transgenic plants can be self crossed to produce a 
homozygous inbred plant. The inbred plant produces seed containing the newly 
introduced heterologous nucleic acid. These seeds can be grown to produce 
plants that would produce the selected phenotype. 

Parts obtained from the regenerated plant, such as flowers, seeds, leaves, 
branches, fruit, and the like are included in the invention, provided that these parts 
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comprise cells comprising the isolated nucleic acid of the present invention. 
Progeny and variants, and mutants of the regenerated plants are also included 
within the scope of the invention, provided that these parts comprise the 
introduced nucleic acid sequences. 
5 Transgenic plants expressing a selectable marker can be screened for 

transmission of the nucleic acid of the present invention by, for example, standard 
immunoblot and DNA detection techniques. Transgenic lines are also typically 
evaluated on levels of expression of the heterologous nucleic acid. Expression at 
the RNA level can be determined initially to identify and quantitate expression- 

10 positive plants. Standard techniques for RNA analysis can be employed and 
include PCR amplification assays using oligonucleotide primers designed to 
amplify only the heterologous RNA templates and solution hybridization assays 
using heterologous nucleic acid-specific probes. The RNA-positive plants can 
then be analyzed for protein expression by Western immunoblot analysis using the 

15 specifically reactive antibodies of the present invention. In addition, in situ 
hybridization and immunocytochemistry according to standard protocols can be 
done using heterologous nucleic acid specific polynucleotide probes and 
antibodies, respectively, to localize sites of expression within transgenic tissue. 
Generally, a number of transgenic lines are usually screened for the incorporated 

20 nucleic acid to identify and select plants with the most appropriate expression 
profiles. 

A preferred embodiment is a transgenic plant that is homozygous for the 
added heterologous nucleic acid; i.e., a transgenic plant that contains two added 
nucleic acid sequences, one gene at the same locus on each chromosome of a 

25 chromosome pair. A homozygous transgenic plant can be obtained by sexually 
mating (selfing) a heterozygous transgenic plant that contains a single added 
heterologous nucleic acid, germinating some of the seed produced and analyzing 
the resulting plants produced for altered expression of a polynucleotide of the 
present invention relative to a control plant (i.e., native, non-transgenic). Back- 

30 crossing to a parental plant and out-crossing with a non- transgenic plant are also 
contemplated. 

The present invention provides a method of genotyping a plant comprising 
a polynucleotide of the present invention. Genotyping provides a means of 
distinguishing homologs of a chromosome pair and can be used to differentiate 
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segregants in a plant population. Molecular marker methods can be used for 
phylogenetic studies, characterizing genetic relationships among crop varieties, 
identifying crosses or somatic hybrids, localizing chromosomal segments affecting 
monogenic traits, map based cloning, and the study of quantitative inheritance. 

5 See, e.g., Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark, Ed., 
Springer-Verlag, Berlin (1997). For molecular marker methods, see generally. The 
DNA Revolution by Andrew H. Paterson 1996 (Chapter 2) in: Genome Mapping in 
Plants (ed. Andrew H. Paterson) by Academic Press/R. G. Landis Company, 
Austin, Texas, pp.7-21 . 

10 The particular method of genotyping in the present invention may employ 

any number of molecular marker analytic techniques such as, but not limited to, 
restriction fragment length polymorphisms (RFLPs). RFLPs are the product of 
allelic differences between DNA restriction fragments caused by nucleotide 
sequence variability. Thus, the present invention further provides a means to 

15 follow segregation of a gene or nucleic acid of the present invention as well as 
chromosomal sequences genetically linked to these genes or nucleic acids using 
such techniques as RFLP analysis. 

Plants that can be used in the method of the invention include 
monocotyledonous and dicotyledonous plants. Preferred plants include com, 

20 soybean, sunflower, sorghum, canola, wheat, alfalfa, cotton, rice, barley, millet, 
Arabidopsis thaliana, tomato, Brassica vegetables, peppers, potatoes, apples, 
spinach, or lettuce. 

Seeds derived from plants regenerated from transformed plant cells, plant 
parts or plant tissues, or progeny derived from the regenerated transformed 
25 plants, may be used directly as feed or food, or further processing may occur. 

The present invention will be further described by reference to the following 
detailed examples. It is understood, however, that there are many extensions, 
variations, and modifications on the basic theme of the present invention beyond 
that shown in the examples and description, which are within the spirit and scope 
30 of the present invention. All publications, patents, and patent applications cited 
herein are hereby incorporated by reference. 
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EXAMPLES 

Identification of a phytyl/prenyltransferase involved in biosynthesis of 
tocopherols in Synechocystis PCC 6803 and Arabidopsis thaliana. 

PCC 6803 was used as a tool for identification of genes encoding 
enzymes involved in biosynthesis of tocopherols. Synechocystis is a 
cyanobacterium capable of tocopherol biosynthesis. The entire genome of this 
photosynthetic organism has been recently sequenced (Kaneko et al., 1996) and 
the data is available on a public searchable database, called CyanoBase 
(http://www.kazusa.or.jp/cyano/cyano.html). Using CyanoBase, we have identified 
an open reading frame (SLR1736) encoding a phytyl/prenyltransferase involved in 
the biosynthesis of 2-methyl-6-phytylplastoquinol, one of the tocopherol 
precursors. This open reading frame was identified based on similarity with the 
phytyl/prenyltransferase SLR0056, a phytyl/prenyltransferase involved in the 
biosynthesis of chlorophyll in Synechocystis PCC 6803. SLR0056 exhibits a high 
homology with the previously identified chlorophyllide/phytyl/prenyltransferases 
from many cyanobacteria and A. thaliana (Lopez et al., 1996), suggesting that this 
enzyme is also involved in chlorophyll synthesis. 

SLR1736 is similar, but not highly homologous to the SLR0056 open 
reading frame. However, the putative prenyl-binding domain is highly conserved 
in SLR1736. making it a good candidate for the tocopherol 
phytyl/prenyltransferase. Using the SLR1736 translated sequence as a query in 
the blast search, a genomic clone on chromosome II was identified in the A. 
thaliana database (Stanford Genomic Resources). This genomic clone was used 
to isolate an Arabidopsis cDNA clone. The F19F24 genomic clone and 
Arabidopsis cDNA are highly homologous to the SLR1736 protein sequence. The 
prenyl-binding domain is also conserved in the F19F24 and Arabidopsis cDNA. In 
addition, the amino terminal deduced amino acid sequence of the Arabidopsis 
gene and cDNA exhibits the traits of chloroplast-targeting sequences. Tocopherol 
biosynthesis has been shown to take place in chloroplast envelopes (Soil et al., 
1980; Soil. 1987). We believe that the Arabidopsis F19F24 gene and homologous 
cDNA represent the orthologous phytyl/prenyltransferase that attaches 
phytyldiphosphate (phytyl-PP) and/or geranylgeranyl pyrophosphate (GGPP) to 
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homogentisic acid in tocopherol synthesis in A. thatiana. Additionally, a 1 .2 kb 
corn EST, chste82, that is highly homologous to SLR1736 has been also identified 
in a blast search. 

To demonstrate that SLR1736 might be involved in tocopherol biosynthesis 

5 in Synechocystis, this gene was disrupted by insertion of the kanamycin 
expression cassette. The method of gene disruption by gene replacement 
technique has been previously described (Williams, 1988). The resulting mutant 
was named ASLR1736. Before analyses, the mutant was sub-cultured at least 6 
times by single colony section on kanamycin to select for the colonies containing 

10 only copies of the SLR1736 gene disrupted with the kanamycin resistance gene. 
The absence of wild type SLR 1736 genes was confirmed by PCR. The lack of 
tocopherols in the mutant was shown by HPLC separation of lipid extracts from 
wild type and mutant Synechocystis on a normal-phase column using fluorescent 
detection (FLD). Levels of phylloquinone (vitamin K1 ) and plastoquinone seem to 

15 be unaffected in this mutant. This suggests that there are at least two separate 
prenyltransferase activities for tocopherol and plastoquinone synthesis in 
Synechocystis and we may be able to manipulate carbon flow through the 
pathway by altering gene expression of either. Phytylation/prenylation of 
homogentisic acid is the branch-point in tocopherol and plastoquinone synthesis, 

20 and therefore, most likely an important regulatory point of the pathway. As well as 
the prenylation activities, availability of different prenyl tails may also be crucial for 
the regulation of carbon flow through the pathway. This will become significant for 
manipulating tocopherol levels in higher plants. 

25 Amplification of the SLR1736 open reading frame from Synechocystis 

Chromosomal DNA from wild type Synechocystis PCC 6803 was isolated 
according to Williams (Methods in Enzymology (1987) 167: 766-778). The 
following primers were designed using Mac Vector computer program to amplify a 
1.022 kb fragment containing the SLR1736 open reading frame: 



SLR1 736F: 5 , -TATTCATATGGCMCTATCCAAGCTTTTTG-3 , 
SLR1 736R: 5'-GGATCCTAATTGAAGAAGATACTAAATAGTTC-3' 
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Ndel and BamHI sites were added to the primers to facilitate sub-cloning for 
expression purposes. ATG in the SLR1736F primer is the start codon for the 
SLR1736 open reading frame published in the CyanoBase Web-site. Taq 
polymerase (Gibco BRL) was used for gene disruption purposes and later Vent 
polymerase (NEB) was used for expression purposes following the manufacturer's 
recommendations. The following cycles were performed: 
For Taq polymerase amplification: 
95 °C/5 minutes (1 cycle) 

95 °C/45 seconds, 45 °C/45 seconds, 68 °C/45 seconds (5 cycles) 
95 °C/45 seconds, 52 °C/45 seconds, 72 °C/45 seconds (30 cycles) 
72 °C/10 minutes 

The same thermocycler conditions were used to amplify SLR1736 with Vent 
polymerase except that elongation times were extended to 2 minutes. 

Sub-cloning the SRL1736 PCR product 

Plasmid pBluescript KS II (Stratagene) was digested with EcoRV (NEB) 
according to manufacturer's protocols. Both linearized pBluescript and the 
amplified SLR1736 open reading frame were separated by 0.9 % agarose TBE gel 
electrophoresis. The bands were excised and purified from the gel using a 
JetSorb DNA purification kit (PGC Scientifics). The purified fragment was sub- 
cloned into the EcoRV site of pBluescript KS II in a blunt-end ligation reaction. A 
10 y\ ligation reaction contained 20 mM Tris-HCI (pH 7.6), 5 mM MgCI 2 , 5 mM 
DTT, 50 fig/ml BSA, 0.5 mM rATP, 15 % PEG, and 1 U of T4 DNA ligase (Gibco 
BRL). Ligation was carried out at room temperature for 4 hours. One half of the 
reaction mixture was used to transform competent £ coli DH5a cells. 
Transformants were then selected on LB plates containing 100 mg/L of ampicillin. 
X-gal and IPTG were used for blue/white selection. White ampicillin resistant 
colonies were then selected, grown in liquid LB/ampicillin media, and plasmids 
were purified. The resulting plasmid was designated as KS-1736 and the nature 
and the orientation of the 1736 insert was determined by restriction digestion and 
sequencing (ABI Prism 310 Genetic Analyser). Clone #5, in which SLR1736 was 
in a reverse orientation to the Lac promoter of the vector, was selected for further 
manipulations. 
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The SLR1736 replacement construct 

Transformation followed by homologous recombination is feasible in 
Synechocystis (Williams, 1988). A gene of interest, in our case SLR1736, can be 
easily disrupted by inserting an antibiotic resistance gene into the coding region. 
5 Such a disruption construct can be transformed into Synechocystis. pBluescript 
KS or any other vector capable of replication in E. coli can be used as a vector. 
These vectors cannot be replicated by DNA replication machinery of 
Synechocystis so that the cells are forced to keep the resistance gene by other 
means when kept on the antibiotic selection. In Synechocystis, the wild type 

10 copies of the target gene are replaced with the copies of this gene disrupted with 
the antibiotic resistance cassette by homologous recombination. Since this 
cyanobacterium contains multiple copies of its genome, it is necessary to streak 
selected resistant colonies on the selection media several times. This should 
ensure replacement of the wild type copies of the gene with the disrupted ones 

15 (Williams, 1988). 

The kanamycin resistance gene from the transposon Tn903 encoding 
aminoglycoside 3'-phosphotransferase was used to disrupt the wild type SLR1736 
gene. Plasmid pUC4K (Pharmacia) was cut with EcoRI to release the kanamycin 
resistance expression cassette. Since SLR1736 has a unique Mfel site about 200 

20 bp from the beginning of the gene, plasmid KS-1736 #5 was digested with Mfel 
(NEB). Mfel leaves 5'-cohesive ends compatible with EcoRI so that no other 
molecular manipulations are necessary. The two DNA fragments were purified 
from agarose gels as described above and ligated using T4 DNA ligase (Gibco 
BRL) as recommended by the manufacturer. Competent £. coli DH5a cells were 

25 transformed with the ligation reaction and transformants selected on LB plates 
containing 50 mg of kanamycin per liter of media. Plasmids were purified and 
subjected to restriction analysis. Two plasmids having opposite orientation of the 
kanamycin cassette were chosen for Synechocystis transformation. The two 
constructs were designated as KSA1736-KAN-F and B, respectively, to indicate 

30 the orientation of the kanamycin resistance gene in respect to the SLR1736 gene. 

Transformation of Synechocystis PCC 6803 with KSA1736-KAN-F and B, 
respectively, was carried out as described by Williams (1988). Transformants 
were selected on BG-11 plates containing 15 mM glucose and 5 mg of kanamycin 
per liter of medium. Two independent colonies from each transformation were 
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then sub-cultured once a week for several weeks on BG-11 plates containing 15 
mM glucose and 15 mg/L kanamycin before being analyzed. The cells were 
grown under continuous light at 30°C. The resulting clones used for further 
analyses were designated ASLR1736 F-1, F-2, B-1, and B-2. 

5 

Confirmation of the SLR1736 gene disruption by PCR 

Chromosomal DNA from wild type and ASLR1736 mutant Synechocystis 
PCC 6803 was isolated from a few colonies according to Cai and Wolk (1990) with 
minor modifications as follows: Cells were resuspended in 200 jil of 50 mM 

10 Tris.HCI and 10 mM EDTA solution of pH 7.5. The cells were then transferred to a 
2 ml screw-cup tube. 10 \x\ of 20% SDS, 200 \il of phenolrchloroform (1:1), and 
white sand were added. The samples were mixed well by vortexing for 1 minute 
and then they were placed on ice for another minute. This step was repeated 
twice. The mixture was centrifuged at 14,000 rpm for 5 minutes to separate 

15 organic and aqueous layers. The top aqueous phase was then extracted twice 
with an equal volume of chloroform and precipitated with a quarter of volume of 
3M potassium acetate (pH 4.8) and two volumes of 96% of ethanol. After an hour 
incubation at -20°C and a ten-minute centrifugation at 14,000 rpm, genomic DNA 
was washed once with 80% of ethanol, dried in a speedvac for 2-3 minutes, and 

20 resuspended in 20 \i\ of water. DNA was diluted 1:10 with water and used as a 
template in PCR reactions. PCR was performed as described above using Taq 
polymerase (Gibco BRL). Insertion of the kanamycin cassette into the SLR1736 
open reading frame was clearly demonstrated. 

25 HPLC analyses of the lipid extracts from wild type and mutant 
Synechocystis. 

Tocopherol analysis 

About 30 mg of wild type and ASLR1736 F and B mutant cells grown on 
solid plates as described above were harvested and resuspended in 450 jil of 
30 methanol: chloroform (2:1) containing 1 mg/ml of butylated hydroxytoluene (BHT) 
to prevent oxidation of tocopherols. 200 ng of tocol was added as an internal 
standard. The cells were homogenized using mini-pestals followed by addition of 
150 mJ of chloroform and 300 \i\ of water to the mixture. After centrifugation (5 
minutes at 14,000 rpm), the lower organic phase was transferred to a clean 
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microfuge tube and dried in a speedvac. Lipids were resuspended in 80 jil of 
hexane containing 1 mg/ml of BHT. 40 \i\ of the lipid extract was subjected to 
HPLC (Hewlett-Packard 1100 Series HPLC system with a fluorescence detector) 
using a normal phase column (Lichrosorb Si60A 4.6 X 250 mm) equilibrated at 
42°C. A 20-minute linear gradient of 8 % to 18 % di-isopropyl ether in hexane was 
used to separate different types of tocopherols. After excitation at a wavelength of 
290 nm, tocopherols were detected by their fluorescence at 325 nm. 

Wild type Synechocystis accumulates predominantly a-tocopherol. The 
ASLR1736 disruption mutants lack all tocopherols and this effect is independent of 
the kanamycin cassette orientation. These results indicate that SLR1736 is 
involved in tocopherol biosynthesis and acts upstream of the methyltransferases. 
Disruption of the methyltransferase genes SLL0418 (2-methyl-6-phytylplastoquinol 
methyltransferase) and SLR0089 (y-tocopherol methyltransferase) which have 
been recently cloned from Synechocystis leads to the accumulation of p- and y- 
tocopherols, respectively (Shintani, D., personal communication; Shintani and 
DellaPenna, 1998). The only two possible remaining enzymes are the cyclase and 
prenyltransferase. Since SLR1736 exhibits a similarity to known 
prenyltransferases, we believe this enzyme represents a prenyltransferase. More 
conclusive proof than the one based on similarity would be given by in vitro 
prenyltransferase assays and feeding studies of wild type and ASLR1736 mutant 
Synechocystis with 14 C uniformly labeled tyrosine. 

Phylloquinone and plastoquinone analysis 

Formation of homogentisic acid, the first step of the pathway, is common for 
both tocopherols and plastoquinone in photosynthetic organisms. To answer the 
question if the tocopherol prenyltransferase is also involved in plastoquinone 
biosynthesis and how carbon flow is affected in the plastoquinone part of the 
pathway, we analyzed lipid extracts from wild type and the mutant cells. On the 
other hand, the phytyl tail is a part of vitamin K1 (phylloquinone) molecule. To 
estimate effects of the SLR1736 gene disruption on the phylloquinone 
biosynthesis in Synechocystis, we also performed vitamin K1 analysis. 

About 30 mg of wild type and ASLR1736 F and B mutant were harvested 
and resuspended in 450 \i\ of methanol: chloroform (2:1). The cells were 
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homogenized using mini-pestals followed by addition of 150 ^1 of chloroform and 
300 fil of water to the mixture. After centrifugation (5 minutes at 14,000 rpm) t the 
lower organic phase was transferred to a clean microfuge tube, dried in a 
speedvac, dissolved in 30 \d of ethyl acetate, and oxidized with silver oxide for a 
5 half an hour. The entire extract was loaded on a TLC plate (Silica, 60A) which 
was developed in 20% diethyl ether in petroleum ether and dried. The plate was 
sprayed with leucomethylene blue (Crane & Barr, 1971) to visualize any changes 
in quinone composition. No differences between the wild type and mutant quinone 
profiles were observed. To prepare leucomethylene blue, 50 mg of methylene 
10 blue and 0.5 g of zinc dust were mixed in 5 ml of water. The mixture was acidified 
with a few drops of concentrated sulfuric acid and left to react for about 10 minutes 
before use. 

To quantify possible changes in quinone content in wild type and mutant 
Synechocystis, HPLC analyses of lipid extracts containing plastoquinone-8 as an 

15 internal standard were performed. Lipids were extracted as described above 
except that 1 ng of plastoquinone-8 was added in the beginning of extraction. 
Plastoquinone-8 and plastoquinone-9 standards were isolated and purified from 
Iris holandica bulbs (Hutson & Therfall, 1980) and their concentrations were 
determined using the molar absorption coefficient of plastoquinone-9 at 254 nm, 

20 17.94 mM" 1 cm" 1 . These quinones have similar properties and they can be easily 
separated by the HPLC method described below. Therefore, plastoquinone-8 is 
an excellent internal standard. 

After extraction, quinones were resuspended in 80 ^ of HPLC grade ethyl 
acetate. 40 jxl of the lipid extract was subjected to HPLC (Hewlett-Packard 1 1 00 

25 Series HPLC system) using a C-18 reverse phase column (Spherisorb, 4.6 X 250 
mm). The following conditions were utilized to separate different quinones: 
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The flow of the solvents was 0.8 ml/min and the separation was performed 
5 at room temperature. Quinones were detected by their absorbance at 250 and 
275 nm using a diode array detector and the identity of phylloquinone and 
plastoquinone-9 was confirmed by comparison with their previously published 
spectra (Crane & Barr, 1971). No differences in vitamin K1 and plastoquinone-9 
compositions were observed between wild type and the ASLR1736 disruption 
10 mutant. This indicates that the SLR1736 gene product is involved only in 
tocopherol biosynthesis. 

Cloning phytyl transferase from A. thaliana involved in tocopherol 
biosynehesis 

15 A developing seed-specific cDNA library from A.thaliana (iambda-ZAP type, 

provided by John Ohlrogge at the Michigan State University) was screened using 
a PCR product from wild type A. thaliana genomic DNA (Ler ecotype) which 
exhibits a high degree of homology with the Synechocystis phytyl transferase. 
Primers AT1736F (5 , -TTGTTTTCAGGCTGTTGTTGCAGCTCTC-3 , ) and AT1736B 

20 (5M:GTTTCTGACCCAGAGTTACAGAGAATG-3') were used to amplify about 1 kb 
fragment corresponding to 60238 - 61229 bp region of the BAC clone F19F24 (A. 
thaliana database at Stanford). The following program was used to amplify this 
fragment with Vent DNA polymerase (New England Biolabs): 
95 °C/5 minutes (1 cycle) 

25 95 °C/45 seconds; 50 °C/45 seconds, ( 72 °C/1 minute (30 cycles) 
72 °C/10 minutes (1 cycle) 

The PCR product was then sub-cloned into EcoRV site of pBluescript KS 
(Stratagene) as in the case of the cyanobacterial phytyl transferase presented 
above and sequenced from both ends using T3 and T7 primers (Stratagene) to 
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ensure the identity of the sub-cloned fragment. A 300 bp fragment of the insert 
(5'-end) was released with EcoRI from the vector and used as a radioactively- 
labeled probe to obtain full-length clones. About 2.5 million plaques of the seed- 
specific library were screened using standard procedures (Sambrook, J., Fritsch. 
E.F. & Maniatis. T. (1989). Molecular Cloning. 2 nd edition, Cold Spring Harbor 
Laboratory Press). 16 positive non-purified plaques were chosen for PCR 
analysis using T3 and AT1736T7c (5-GACATATTTTTGCAGTCTGCC-3) which is 
an internal primer for the phytyl transferase. Clones #1, 3. 5, 8, 11, 12, and 14 
were selected for further purification and single clone excision, performed 
according to manufacturer (Stratagene), to obtain individual clones in pBluescript 
SK plasmids. Each clone was sequenced from each end using T3 and T7 primer. 
The longest clone. #11 - about 1.6 kb, was chosen for complete sequencing 
which is in progress now. All clones were aligned to the genomic clone F19F24 
from A. thaliana to confirm their identity, identify introns and find possible 
sequencing mistakes in the genomic sequence. We believe that ATG codon 
(59220 bp on F19F24) is the start codon of the phytyl transferase involved in 
tocopherol synthesis in A. thaliana. Starting from this methionine, the first 36 
amino acids represent the chloroplast thyfakoid membrane-targeting sequence 
(PSORT program, hhtp://psort.nibb.ac.jp:8800/). 

Confirmation of prenyltransferase nature of SLR1736 

To confirm the prenyltransferase nature of SLR1736, the intact gene will be 
expressed in E. coli because this bacterium lacks any enzymatic activity 
connected to tocopherol biosynthesis. Therefore, SLR1736 activity will be shown 
by an in vitro phytyl/prenyltransferase assay using protein extracts from E. coli 
expressing SLR1736 or by reconstruction of multiple steps of the pathway in E. 
coli. 14 C uniformly labeled p-hydroxyphenyl pyruvate and phytyl-PP, or other 
prenyl diphosphates will be used as substrates, p-hydroxyphenyl pyruvate 
dioxygenase catalyses conversion of p-hydroxyphenylpyruvic acid to homogentisic 
acid, the immediate substrate for the tocopherol and plastoquinone 
prenyltransferase(s). Therefore, A. thaliana p-hydroxyphenylpyruvic acid 
dioxygenase (Norris er a/., 1998) expressed in E. coli along with the 
prenyltransferase will be present in the reactions to couple the two enzymatic 
steps. To further show that SLR1736 is a prenyltransferase, ASLR1736 and wild 



WO 00/68393 PCT/US00/1 1439 

-41 - 

type Synechocystis will be grown in the presence of 14 C uniformly labeled L- 
tyrosine to trace prenylated products by using TLC and autoradiography. 

The SLR1736 open reading frame will be also expressed in E. coli in the 
presence of p-hydroxyphenylpyruvic acid dioxygenase (Norris et a/. t 1998), Adonis 

5 paleastina geranylgeranyl diphosphate synthase (gift from F. Cunningham), and 
geranylgeranyl hydrogenase from Synechocystis (SLL1091, Addlesee et al., 1996; 
Keller et a/ M 1998) to reconstitute the phytyl pyrophosphate pathway since E. coli 
does not possess any of these enzymatic activities. Lipids will be extracted and 
subjected to HPLC analysis of quinones as described above. 2-methyl-6- 

10 phytylplastoquinone is stable and should be present in E. coli lipid extracts. 

SLR1736 homologue from A. thaliana (AT1736) 

To investigate the role of the plant homologue of SLR1736, the intact full 
length cDNA from Arabidopsis thaliana (AT1736) and com chste82 EST will be 

15 expressed in the sense and antisense orientation from the constitutive CaMV 35S 
or seed-specific (Seffens et al M 1990) promoters, respectively, in A, thaliana. 
Visible phenotype(s) will be observed and lipids from the transgenic plants will be 
extracted and subjected to HPLC/FLD analyses to detect changes in tocopherol 
content and composition in green tissues and seeds. Plastoquinone and 

20 phyloquinone levels will also be analyzed as described above. It is possible that 
phytyl-PP is limiting for the prenyltransferase activity. Consequently, we may want 
to overexpress geranylgeranyl pyrophosphate synthase and GGDP dehyrogenase 
to elevate phytl-PP levels in A. thaliana. Columbia ecotype Arabidopsis plants will 
be transformed with these overexpression constructs separately and homozygous 

25 transformants will be crossed to obtain plants containing all three constructs. 

The in vitro prenyltransferase assay will be performed with AT1736 
expressed in £. coli as described above for SLR1736. Prenyl tail-specificity 
studies will be also carried out with this enzyme, using various tails such as 
GGDP, phytyl-PP, and solanyl-PP. As in the case of SLR1736 from 

30 Synechocystis, it is important to determine if there are one or two 
prenyltransferases for tocopherol and plastoquinone biosynthesis in higher plants. 
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Construction of p0018 Maize cDNA libraries 



PCT/US00/11439 



Total RNA Isolation 

Total RNA was isolated from p0018 library com tissues with TRIzol 
Reagent (Life Technology Inc. Gaithersburg. MD) using a modification of the 
guanidine isothiocyanate/acid-phenol procedure described by Chomczynski and 
Sacchi (Chomczynski. P., and Sacchi, N. Anal. Biochem. 162. 156 (1987)). In 
brief, plant tissue samples were pulverized in liquid nitrogen before the addition of 
the TRIzol Reagent, and then were further homogenized with a mortar and pestle. 
Addition of chloroform followed by centrifugation was conducted for separation of 
an aqueous phase and an organic phase. The total RNA was recovered by 
precipitation with isopropyl alcohol from the aqueous phase. 

Poly(A)+ RNA Isolation 

The selection of poly(A)+ RNA from total RNA was performed using 
PolyATact system (Promega Corporation. Madison, Wl). In brief, biotinylated 
oligo(dT) primers were used to hybridize to the 3' poly(A) tails on mRNA. The 
hybrids were captured using streptavidin coupled to paramagnetic particles and a 
magnetic separation stand. The mRNA was washed at high stringent condition 
and eluted by RNase-free deionized water. 

cDNA Library Construction 

cDNA synthesis was performed and unidirectional cDNA libraries were 
constructed using the Superscript Plasmid System (Life Technology Inc. 
Gaithersburg, MD). The first stand of cDNA was synthesized by priming an 
oligo(dT) primer containing a Not I site. The reaction was catalyzed by 
Superscript Reverse Transcriptase II at 45°C. The second strand of cDNA was 
labeled with alpha-32P-dCTP and a portion of the reaction was analyzed by 
agarose gel electrophoresis to determine cDNA sizes. cDNA molecules smaller 
than 500 base pairs and unligated adapters were removed by Sephacryl-S400 
chromatography. The selected cDNA molecules were ligated into pSPORTI 
vector in between of Not I and Sal I sites. 
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Sequencing of Maize cDNA and Library Subtraction 

Sequencing Template Preparation 

Individual colonies were picked and DNA was prepared either by PCR with 
M13 forward primers and M13 reverse primers, or by plasmid isolation. All the 
5 cDNA clones were sequenced using M13 reverse primers. 

Q-bot Subtraction Procedure 

cDNA libraries subjected to the subtraction procedure are plated out on 22 
x 22 cm2 agar plate at density of about 3,000 colonies per plate. The plates are 

10 incubated in a 37°C incubator for 12-24 hours. Colonies are picked into 384-weII 
plates by a robot colony picker, Q-bot (GENETIX Limited). These plates are 
incubated overnight at 37°C. 

Once sufficient colonies are picked, they were pinned onto 22 x 22 cm2 
nylon membranes using Q-bot. Each membrane contained 9,216 colonies or 

15 36,864 colonies. These membranes are placed onto agar plate with appropriate 
antibiotic. The plates are incubated at 37°C for overnight. 

After colonies are recovered on the second day, these filters are placed on 
filter paper prewetted with denaturing solution for four minutes, then are incubated 
on top of a boiling water bath for additional four minutes. The filters are then 

20 placed on filter paper prewetted with neutralizing solution for four minutes. After 
excess solution is removed by placing the filters on dry filter papers for one 
minute, the colony side of the filters are place into Proteinase K solution, 
incubated at 37oC for 40-50 minutes. The filters are placed on dry filter papers to 
dry overnight. DNA is then cross-linked to nylon membrane by UV light treatment. 

25 Colony hybridization is conducted as described by Sambrook,J. f Fritsch, 

E.F. and Maniatis, T., (in Molecular Cloning: A laboratory Manual, 2nd Edition). 
The following probes were used in colony hybridization: 

1. First strand cDNA from the same tissue as the library was made from to 
remove the most redundant clones. / 
30 2. 48-192 most redundant cDNA clones from the same library based on previous 
sequencing data. 

3. 192 most redundant cDNA clones in the entire com sequence database. 

4. A Sal-A20 oligo nucleotide: TCG ACC CAC GCG TCC GAA AAA AAA AAA 
AAA AAA AAA, removes clones containing a poly A tail but no cDNA. 
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The image of the autoradiography is scanned into computer and the signal 
intensity and cold colony addresses of each colony is analyzed. Re-arraying of 
cold-colonies from 384 well plates to 96 well plates is conducted using Q-bot. 

Identification of Gene from a Computer Homology Search 

Gene identities were determined by conducting BLAST (Basic Local 
Alignment Search Tool; Altschul, S. F. ( et a/., (1993) J. Mol. Biol. 215:403-410; 
see also www.ncbi.nlm.nih.gov/BLAST/) searches under default parameters for 
similarity to sequences contained in the BLAST "nr" database (comprising all non- 
redundant GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the last major release of the SWISS- 
PROT protein sequence database, EMBL, and DDBJ databases). The cDNA 
sequences were analyzed for similarity to all publicly available DNA sequences 
contained in the "nr" database using the BLASTN algorithm provided by the 
National Center for Biotechnology Information (NCBI). The DNA sequences were 
translated in all reading frames and compared for similarity to all publicly available 
protein sequences contained in the M nr" database using the BLASTX algorithm 
(Gish, W. and States, D. J. Nature Genetics 3:266-272 (1993)) provided by the 
NCBI. In some cases, the sequencing data from two or more clones containing 
overlapping segments of DNA were used to construct contiguous DNA 
sequences. 
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Construction of Additional Maize, Rice, Soybean and Wheat cDNA libraries 



Composition of cDNA Libraries; Isolation and Sequencing of cDNA Clones 

cDNA libraries representing mRNAs from maize, rice, soybean and wheat 
5 tissues were prepared. The characteristics of these libraries are described in 



Table 1. 



TABLE 1 


Library 
Designation 


Library Description 


Clone 


ceo In 


Corn Cob of 67 Day Old Plants Grown in Green House* 


ccoln.pk087.117 


p0018 


Seedling after 10 day drought (T001), heat shocked for 24 hrs (T002), 
recovery at normal growth condition for 8 hrs, 16 hrs, 24hrs 


p0018.chste82r:fis 


p0108 


PR leaves + C.carbonium, screened 1 Pool of PR+C. carbonium tox-3h; 
PR+C. carbonium tox-6h; PR+C. carbonium tox-24h; PR+C. carbonium 
tox-48hr; and PR+C. carbonium tox-7 7 days 


p0108.cjrrnc89nfis 


rcaln 


Rice (Oryza sativa L., Nipponbare) callus normalized. 


rcaln.pk025.c4 


rlOn 


Rice 15 Day Old Leaf* 


rl0n.pk0066.e2:fis 


scrlc 


Soybean (Glycine max L., 2872) Embryogenic suspension culture subjected 
to 4 vacuum cycles and collected 12 hrs later (control scblc). 


scrlc.pk005.12 


sgc7c 


Soybean (Glycine max L., Wye) germanating cotyledon (yellow and 
wilting; 18-30 DAG). 


Sgc7c.pk001.h22 


src2c 


Soybean (Glycine max L., 437654) 8 day old root inoculated with eggs of 
cyst Nematode (Race 1) for 4 days. 


src2c.pk020.d5:fis 


wdk2c 


Wheat Developing Kernel, 7 Days After Anthesis. 


wdk2c.pk012.f2 


wlmO 


Wheat Seedlings 0 Hour After Inoculation With Erysiphe graminis / sp 
tritici 


wlm0.pk001l.c7 



♦These libraries were normalized essentially as described in U.S. Pat. No. 5,482,845, incorporated herein by 
reference. 

10 

In general, cDNA libraries may be prepared by the method described above 
or by any one of many other methods available. For example, the cDNAs may be 
introduced into plasmid vectors by first preparing the cDNA libraries in Uni-ZAP™ 
XR vectors according to the manufacturer's protocol (Stratagene Cloning 

15 Systems, La Jolla, CA). The Uni-ZAP™ XR libraries are converted into plasmid 
libraries according to the protocol provided by Stratagene. Upon conversion, 
cDNA inserts will be contained in the plasmid vector pBluescript. In addition, the 
cDNAs may be introduced directly into precut Bluescript II SK(+) vectors 
(Stratagene) using T4 DNA ligase (New England Biolabs), followed by transfection 

20 into DH10B cells according to the manufacturer's protocol (GIBCO BRL Products). 
Once the cDNA inserts are in plasmid vectors, plasmid DNAs are prepared from 
randomly picked bacterial colonies containing recombinant pBluescript plasmids, 
or the insert cDNA sequences are amplified via polymerase chain reaction using 
primers specific for vector sequences flanking the inserted cDNA sequences. 
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Amplified insert DNAs or plasmid DNAs are sequenced in dye-primer sequencing 
reactions to generate partial cDNA sequences (expressed sequence tags or 
"ESTs"; see Adams et al., (1991) Science 252:1651-1656). The resulting ESTs 
are analyzed using a Perkin Elmer Model 377 fluorescent sequencer. 

Characterization of cDNA Clones Encoding Phytvl/prenvltransferase. 
cDNA Clones were identified by computer homology search as described above. 
The BLASTP and BLASTN searches using the sequences from clones listed in 
Table 1 revealed similarity to certain polypeptides as shown in Table 2. The 
7blast/data/2.0/2/nr* database was searched. GAP results showing % identity to 
synechocystis and arabidopsis are also shown. Table 2 shows the BLAST results 
for individual complete gene sequences ( tt CGS w ). 
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TABLE 2 

Top BLAST Results for Sequences Encoding Polypeptides Homologous 
to Phytyl/prenyltransferase and GAP % Identity to Synechocystis and Arabidopsis 



Clone 


Status 


Protein Sequence with Significant 
Alignment 
gi# (accession #) Organism; % Blast 
Identity 


GAP % 
Identity 

Clone to 


GAP % 
Identity 

Clone to 
At-OUJO to 


SEQ ID 12 - 
Contig of: 

ccoln.pk087.1 17and 
cen3n.pk0012.h6 


CGS 


1652856 (D90909) Synechocystis; 36% 
3004556 (AC003673) Arabidopsis; 32% 


36.80% 


50.27% 


SEQ ID 4 - 
pO018xhstc82nfis 


CGS 


1652856 (D90909) Synechocystis; 43% 
3004556 (AC003673) Arabidopsis; 47% 


43.58% 


70.67% 


SEQ ID 14 - 
p0t08.cjrmc89rfis 


CGS 


1 652856 (D90909) Synechocystis; 36% 
6015890 (Y18930) Sulfolobus; 30% 
5103549 (AP00O058) Aeropyrum; 32% 
3004556 (AC003673) Arabidopsis; 26% 


37.54% 


30.30% 


SEQ ID 16 - 
rcaln.pk025.c4 


CGS 


1652856 (D90909) Synechocystis; 45% 
3004556 (AC003673) Arabidopsis; 43% 


45.27% 


70,67% 


SEQ ID18- 

rl0n.pk0066.e2:fi$ 




1652856 (D90909) Synechocystis; 35% 
5103549 (AP000058) Aeropyrum; 29% 
6015890 (Y18930) Sulfolobus; 28% 
3004556 (AC003673) Arabidopsis; 25% 


35.59% 


"> 1 1 £0/ 

31.1 6% 


SEQ ID 20 - 
scrlc.pk005.12 


CGS 


1652856 (D90909) Synechocystis; 37% 
6015890 (Y18930) Sulfolobus; 28% 
3004556 (AC003673) Arabidopsis; 25% 


36.95% 


33.33% 


SEQ ID 22 - 
Contig of: 
Sgc7c.pk00!.h22 


CGS 


1652856 (D90909) Synechocystis; 44% 
3004556 (AC003673) Arabidopsis; 83% 


44.90% 


75.48% 


SEQ ID 24 - 
src2c.pk020.d5:fis 


CGS 


3004556 (AC003673) Arabidopsis; 39% 
1652856 (D90909) Synechocystis; 29% 


30.51% 


52.62% 


SEQ ID 26 - 
Contig of: 
wdk2c.pk012.f2 


Partial 
Gene 
Seq 


3004556 (AC003673) Arabidopsis; 45% 
1652856 (D90909) Synechocystis; 37% 


37.50% 


43.75% 


SEQ ID 28- 
Contig of: 
wim0.pk00n.c7 


CGS 


1652856 (D90909) Synechocystis; 36% 
3004556 (AC003673) Arabidopsis; 27% 


37.54% 


30.30% 



Sequence alignments and BLAST sequence identities indicate that the 
nucleic acid fragments comprising the instant cDNA clones encode a substantial 
portion of a phytyl/prenyltransferase. 
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A BLASTN search of 7blast/data/2.0/3/est" using the sequences from 
clones listed in Table 1 showed homology (E score > 140) to the following 
sequences on Table 3 as indicated by Genbank Accession number. 

TABLE 3 - GAP Search Result 



■fS^umbfei^^ 


^Species ^ 


^howsfHomoloqyWith:SeqiD^Nd 


C25006 


Rice 


17 


C74444 


Rice 


15 


AA750728 


Rice 


15 


AA749638 


Rice 


15 


AU029707 


Rice 


15 


AI612332 


Com 


13 


AI711952 


Com 


13 


AI795680 


Corn 


13 * 


AI897027 


Tomato 


21 


AI938270 


Soybean 


21 


AI938569 


Soybean 


21 






23 


AI948381 


Com 


13 


AW052841 


Com 


13 


AW054141 


Com 


11 


AW066179 


Com 


11 


AW146615 


Com 


13, 






17 


AW202246 


Soybean 


19 


AI444024 


Soybean 


19 


AI442111 


Soybean 


19 


AW1 32909 


Soybean 


23 






21 


AI748688 


Soybean 


21 


AI939002 


Soybean 


19 


AW306617 


Soybean 


21 


AW433064 


Soybean 


21 


AW563431 


Sorghum 


17 



In sequencing clone containing SEQ ID NO: 11, an extra nucleotide at nt 
631 was observed. In addition, possible frameshifts at nt 107-140 were located 
that may interrupt homology to the Synechocystis hypothetical protein # 1652856. 
The extra nucleotide at nt 631 was deleted from the sequence listing provided with 
this application, and sequence identity was determined without considering the 
extra nucleotide. The extra nucleotide is likely an artifact occurring during the 
isolation and sequencing of the cDNA clones. 
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Clones p0108.cjrmc89nfis and r10n.pk0066.e2:fis each contain substantially 
complete gene sequence, with the exception of a few N-terminal amino acids on 
each. 

5 Clone wdk2c.pk012.f2 has an apparent intron from nt 322 to 426, as 

determined by GT/AG intron borders, that interrupt homology to 
p0018.chste82r:fis> The sequence listing provides both the nucleotide sequence 
of the clone with the intron (SEQ ID NO: 29) and without the intron (SEQ ID NO: 
25). Amino acid sequence identity in SEQ ID NO: 26 was determined after 

10 removal of the intron. 

The amino acid sequence of clone wlmO.pkOOl 1 .c7 covers the entire 
phytyl/prenyltransferase and contains a putative transit peptide sequence. 

15 Expression of Chimeric Genes in Monocot Cells 

A chimeric gene comprising a cDNA encoding the instant polypeptides in 
sense orientation with respect to the maize 27 kD zein promoter that is located 5' 
to the cDNA fragment, and the 10 kD zein 3' end that is located 3' to the cDNA 
fragment, can be constructed. The cDNA fragment of this gene may be generated 

20 by polymerase chain reaction (PCR) of the cDNA clone using appropriate 

oligonucleotide primers. Cloning sites (Ncol or Smal) can be incorporated into the 
oligonucleotides to provide proper orientation of the DNA fragment when inserted 
into the digested vector pML103 as described below. Amplification is then 
performed in a standard PCR. The amplified DNA is then digested with restriction 

25 enzymes Ncol and Smal and fractionated on an agarose gel. The appropriate 
band can be isolated from the gel and combined with a 4.9 kb Ncol-Smal 
fragment of the plasmid pML103. Plasmid pML103 has been deposited under the 
terms of the Budapest Treaty at ATCC (American Type Culture Collection, 10801 
University Blvd., Manassas, VA 201 10-2209), and bears accession number 

30 ATCC 97366. The DNA segment from pML103 contains a 1.05 kb Sall-Ncol 
promoter fragment of the maize 27 kD zein gene and a 0.96 kb Smal-Sall 
fragment from the 3' end of the maize 10 kD zein gene in the vector pGem9Zf(+) 
(Promega). Vector and insert DNA can be ligated at 15°C overnight, essentially 
as described (Maniatis). The ligated DNA may then be used to transform E. coli 
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XL1-Blue (Epicurian Coli XL-1 Blue™; Stratagene). Bacterial transformants can 
be screened by restriction enzyme digestion of plasmid DNA and limited 
nucleotide sequence analysis using the dideoxy chain termination method 
(Sequenase™ DNA Sequencing Kit; U.S. Biochemical). The resulting plasmid 
construct would comprise a chimeric gene encoding, in the 5' to 3' direction, the 
maize 27 kD zein promoter, a cDNA fragment encoding the instant polypeptides, 
and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into corn cells by 
the following procedure. Immature com embryos can be dissected from 
developing caryopses derived from crosses of the inbred com lines H99 and 
LH132. The embryos are isolated 10 to 1 1 days after pollination when they are 
1.0 to 1.5 mm long. The embryos are then placed with the axis-side facing down 
and in contact with agarose-solidified N6 medium (Chu etal. (1975) Sci. Sin. 
Peking 18:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 
proembryoids and embryoids borne on suspensor structures proliferates from the 
scutellum of these immature embryos. The embryogenic callus isolated from the 
primary explant can be cultured on N6 medium and sub-cultured on this medium 
every 2 to 3 weeks. 

The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, 
Frankfurt, Germany) may be used in transformation experiments in order to 
provide for a selectable marker. This plasmid contains the Pat gene (see 
European Patent Publication 0 242 236) which encodes phosphinothricin acetyl 
transferase (PAT). The enzyme PAT confers resistance to herbicidal glutamine 
synthetase inhibitors such as phosphinothricin. The pat gene in p35S/Ac is under 
the control of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. (1985) 
Nature 313:810-812) and the 3" region of the nopaline synthase gene from the 
T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

The particle bombardment method (Klein et al. (1987) Nature 327:70-73) 
may be used to transfer genes to the callus culture cells. According to this 
method, gold particles (1 urn in diameter) are coated with DNA using the following 
technique. Ten ug of plasmid DNAs are added to 50 fiL of a suspension of gold 
particles (60 mg per ml_). Calcium chloride (50 ul_ of a 2.5 M solution) and 
spermidine free base (20 of a 1 .0 M solution) are added to the particles. The 
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suspension is vortexed during the addition of these solutions. After 10 minutes, 
the tubes are briefly centrifuged (5 sec at 15,000 rpm) and the supernatant 
removed. The particles are resuspended in 200 \iL of absolute ethanol, 
centrifuged again and the supernatant removed. The ethanol rinse is performed 

5 again and the particles resuspended in a final volume of 30 \xL of ethanol. An 
aliquot (5 \iL) of the DNA-coated gold particles can be placed in the center of a 
Kapton™ flying disc (Bio-Rad Labs). The particles are then accelerated into the 
com tissue with a Biolistic™ PDS-1000/He (Bio-Rad Instruments, Hercules CA), 
using a helium pressure of 1000 psi, a gap distance of 0.5 cm and a flying 

10 distance of 1.0 cm. 

For bombardment, the embryogenic tissue is placed on filter paper over 
agarose-solidified N6 medium. The tissue is arranged as a thin lawn and covered 
a circular area of about 5 cm in diameter. The petri dish containing the tissue can 
be placed in the chamber of the PDS-1000/He approximately 8 cm from the 

15 stopping screen. The air in the chamber is then evacuated to a vacuum of 

28 inches of Hg. The macrocarrier is accelerated with a helium shock wave using 
a rupture membrane that bursts when the He pressure in the shock tube reaches 
1000 psi. 

Seven days after bombardment the tissue can be transferred to N6 medium 
20 that contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue 
continues to grow slowly on this medium. After an additional 2 weeks the tissue 
can be transferred to fresh N6 medium containing gluphosinate. After 6 weeks, 
areas of about 1 cm in diameter of actively growing callus can be identified on 
some of the plates containing the glufosinate-supplemented medium. These calli 
25 may continue to grow when sub-cultured on the selective medium. 

Plants can be regenerated from the transgenic callus by first transferring 
clusters of tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After 
two weeks the tissue can be transferred to regeneration medium (Fromm et al. 
(1 990) Bio/Technology 8:833-839). 

30 

Expression of Chimeric Genes in Dicot Cells 

A seed-specific expression cassette composed of the promoter and 
transcription terminator from the gene encoding the p subunit of the seed storage 
protein phaseolin from the bean Phaseolus vulgaris (Doyle et al. (1986) J. Biol. 
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Chem. 261 :9228-9238) can be used for expression of the instant polypeptides in 
transformed soybean. The phaseolin cassette includes about 500 nucleotides 
upstream (5') from the translation initiation codon and about 1650 nucleotides 
downstream (3') from the translation stop codon of phaseolin. Between the 5' and 
3' regions are the unique restriction endonuclease sites Nco I (which includes the 
ATG translation initiation codon), Sma I, Kpn I and Xba I. The entire cassette is 
flanked by Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain 
reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. 
Cloning sites can be incorporated into the oligonucleotides to provide proper 
orientation of the DNA fragment when inserted into the expression vector. 
Amplification is then performed as described above, and the isolated fragment is 
inserted into a pUC18 vector carrying the seed expression cassette. 

Soybean embryos may then be transformed with the expression vector 
comprising sequences encoding the instant polypeptides. To induce somatic 
embryos, cotyledons, 3-5 mm in length dissected from surface sterilized, immature 
seeds of the soybean cultivar A2872, can be cultured in the light or dark at 26°C 
on an appropriate agar medium for 6-10 weeks. Somatic embryos which produce 
secondary embryos are then excised and placed into a suitable liquid medium. 
After repeated selection for clusters of somatic embryos which multiplied as early, 
globular staged embryos, the suspensions are maintained as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid 
media on a rotary shaker, 150 rpm, at 26°C with florescent lights on a 16:8 hour 
day/night schedule. Cultures are subcultured every two weeks by inoculating 
approximately 35 mg of tissue into 35 mL of liquid medium. 

Soybean embryogenic suspension cultures may then be transformed by the 
method of particle gun bombardment (Klein et al. (1987) Nature (London) 
327:70-73, U.S. Patent No. 4,945,050). A DuPont Biolistic™ PDS1000/HE 
instrument (helium retrofit) can be used for these transformations. 

A selectable marker gene which can be used to facilitate soybean 
transformation is a chimeric gene composed of the 35S promoter from Cauliflower 
Mosaic Virus (Odell et al. (1985) Nature 373:810-812), the hygromycin 
phosphotransferase gene from plasmid pJR225 (from £. co//; Gritz et al.(1983) 
Gene 25:179-188) and the 3' region of the nopaline synthase gene from the 
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T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The seed expression 
cassette comprising the phaseolin 5' region, the fragment encoding the instant 
polypeptides and the phaseolin 3' region can be isolated as a restriction fragment. 
This fragment can then be inserted into a unique restriction site of the vector 
carrying the marker gene. 

To 50 \xL of a 60 mg/mL 1 \xm gold particle suspension is added (in order): 
5 nL DNA (1 ^g/nL), 20 nl spermidine (0.1 M), and 50 *iL CaCl2 (2.5 M). The 
particle preparation is then agitated for three minutes, spun in a microfuge for 
10 seconds and the supernatant removed. The DNA-coated particles are then 
washed once in 400 jxL 70% ethanol and resuspended in 40 fiL of anhydrous 
ethanol. The DNA/particle suspension can be sonicated three times for 
one second each. Five \xL of the DNA-coated gold particles are then loaded on 
each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in 
an empty 60x15 mm petri dish and the residual liquid removed from the tissue with 
a pipette. For each transformation experiment, approximately 5-10 plates of tissue 
are normally bombarded. Membrane rupture pressure is set at 1100 psi and the 
chamber is evacuated to a vacuum of 28 inches mercury. The tissue is placed 
approximately 3.5 inches away from the retaining screen and bombarded three 
times. Following bombardment, the tissue can be divided in half and placed back 
into liquid and cultured as described above. 

Five to seven days post bombardment, the liquid media may be exchanged 
with fresh media, and eleven to twelve days post bombardment with fresh media 
containing 50 mg/mL hygromycin. This selective media can be refreshed weekly. 
Seven to eight weeks post bombardment, green, transformed tissue may be 
observed growing from untransformed, necrotic embryogenic clusters. Isolated 
green tissue is removed and inoculated into individual flasks to generate new, 
clonally propagated, transformed embryogenic suspension cultures. Each new 
line may be treated as an independent transformation event. These suspensions 
can then be subcultured and maintained as clusters of immature embryos or 
regenerated into whole plants by maturation and germination of individual somatic 
embryos. 
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Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant polypeptides can be inserted into the T7 
E. coli expression vector pBT430. This vector is a derivative of pET-3a 

5 (Rosenberg et al. (1987) Gene 56:125-135) which employs the bacteriophage T7 
RNA polymerasefT7 promoter system. Plasmid pBT430 was constructed by first 
destroying the EcoR I and Hind III sites in pET-3a at their original positions. An 
oligonucleotide adaptor containing EcoR I and Hind III sites was inserted at the 
BamH I site of pET-3a. This created pET-3aM with additional unique cloning sites 

10 for insertion of genes into the expression vector. Then, the Nde I site at the 
position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this 
region, 5'-CATATGG ( was converted to 5'-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a 

15 nucleic acid fragment encoding the protein. This fragment may then be purified 
on a 1 % NuSieve GTG™ low melting agarose gel (FMC). Buffer and agarose 
contain 10 fig/ml ethidium bromide for visualization of the DNA fragment. The 
fragment can then be purified from the agarose gel by digestion with GELase™ 
(Epicentre Technologies) according to the manufacturer's instructions, ethanol 

20 precipitated, dried and resuspended in 20 \iL of water. Appropriate 

oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated 
adapters can be purified from the excess adapters using low melting agarose as 
described above. The vector pBT430 is digested, dephosphorylated with alkaline 

25 phosphatase (NEB) and deproteinized with phenol/chloroform as described 

above. The prepared vector pBT430 and fragment can then be ligated at 16°C for 
15 hours followed by transformation into DH5 electrocompetent cells (GIBCO 
BRL). Transformants can be selected on agar plates containing LB media and 
100 \iglmL ampicillin. Transformants containing the gene encoding the instant 

30 polypeptides are then screened for the correct orientation with respect to the T7 
promoter by restriction enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into E. coli strain 
BL21(DE3) (Studier et al. (1986) J. MoL Biol, 789:1 13-130). Cultures are grown in 
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LB medium containing ampicillin (100 mg/L) at 25°C. At an optical density at 
600 nm of approximately 1, IPTG (isopropylthio-p-galactoside, the inducer) can be 
added to a final concentration of 0.4 mM and incubation can be continued for 3 h 
at 25°. Cells are then harvested by centrifugation and re-suspended in 50 ^iL of 
50 mM Tris-HCI at pH 8.0 containing 0.1 mM DTT and 0.2 mM phenyl 
methylsulfonyl fluoride. A small amount of 1 mm glass beads can be added and 
the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the 
supernatant determined. One ^ig of protein from the soluble fraction of the culture 
can be separated by SDS-polyacrylamide gel electrophoresis. Gels can be 
observed for protein bands migrating at the expected molecular weight. 

Expression of Maize Phvtvl/prenvltransferase in Soybean Somatic Embryos 

The ability to change the levels of total tocopherol levels in plants by 
transforming them with sequences encoding the maize phytyl/prenyltransferase 
was tested by preparing transgenic soybean somatic embryos and assaying the 
tocopherol and oil levels. Plasmid DNA from clone poo18chste82r was used as a 
template for the amplification of the open reading from per by using the following 
two primers AGC GCG GCC GCA TGG ACG CGC TTC GCC TAC GGC 
CGT(forward primer) and AGC GCG GCC GCT CAC CGC ACC AGA GGG ATG 
AGC AG(reverse primer). Pfu polymerase was used according to the 
manufacturers recommendations (Stratagene). The following per reaction mix 
contained the following: 5ng plasmid, 25nmoles dNTPs, 5% DMSO, 1x per buffer 
(supplied), 30nmoles primers, 5U pfu polymerase in 100ul reaction volume. The 
per reaction conditions were as follows: Step 1, 45s 94°C; step 2 25 cycles of 45s 
94°C, 45s 58°c annealing, 2min extension 72°C. Step 3 72°C 10min, step 4 0°C. 
The per product was purified by agarose gel electrophoresis (!% agarose in TAE), 
the ethidium bromide visualized band cut out and purified from the gel by using a 
QIAquick Gel Extraction kit (Qiagen) according to the manufacturers 
recommendations. The purified per product (200ng) was ligated into the srfl site of 
the plasmid PCR-Script cloning vector and the resultant plasmid was used to 
transform E.coli DH10 cells. Colonies containing the 1.2kb Notl fragment were 
identified by antibiotic (ampicillin selection) and blue / white (IPTG + X-gal) 
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selection of colonies on LB/Amp plates. White (recombinant) colonies were picked 
and grown overnight on liquid LB/Amp culture. Positive clones were identified by 
plasmid preparation and restriction digest analysis for the presence of the 1.2kB 
Noti fragment. Positive clones were used as template to fully sequence the phytyl 
5 transferase orf (both strands). Plasmids containing the correct insert verified by 
nucleic acid sequence were digested with Noti and the 1 .2kb fragment ligated to 
Afofl-digested and phosphatase-treated pKS67. The plasmid pKS67 was prepared 
by replacing in pRB20 (described in U.S. Patent No. 5,846784) the 800 bp Nos 3' 
fragment, with the 285 bp Nos 3' fragment containing the polyadenylation signal 
10 sequence and described in Depicker et al. (1982) J. Mol. Appl. Genet 7:561-573. 
Clones were screened for the sense and antisense orientation of the 
phytyl/prenyltransferase insert fragment by restriction enzyme digestion. 

Transformation of Soybean Somatic Embryo Cultures 

is The stock solutions and media shown in Table 4 were used for 

transformation and propagation of soybean somatic embryos: 
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Stock Solutions 


MS Sulfate 100x stock 


(QfU 


MgS0 4 .7H 2 0 


37.0 


MnS0 4 .H 2 0 


1.69 


ZnS0 4 .7H 2 0 


0.86 


CuS0 4 .5H 2 0 


0.0025 


MS Halides 100x stock 


CaCI 2 .2H 2 0 


44.0 


Kl 


0.083 


CoCI 2 .6H 2 0 


0.00125 


KH 2 P0 4 ' 


17.0 


H3BO3 


0.62 


Na 2 Mo0 4 .2H 2 0 


0.025 


Na 2 EDTA 


3.724 


FeS0 4 .7H 2 0 


2.784 


B5 Vitamin stock 


myo-inositol 


100.0 


nicotinic acid 


1.0 


pyridoxine HCI 


1.0 


thiamine 


10.0 



Media 

SB55 (per Liter) 



10 mL of each MS stock 



1 mL of B5 Vitamin stock 



0.8 g NH 4 NQ 3 



3.033 g KNO3 



1 mL 2,4-D (10 mg/mL stock) 



0.667 g asparagine 



pH 5.7 



SB103 (per Liter) 
1 pk. Murashige & Skoog salt mixture* 



60 g maltose 



2 g gelrite 



pH 5.7 



SB 148 (per Liter) 
1 pk. Murashige & Skoog salt mixture* 



60 g maltose 



1 mL B5 vitamin stock 



7 g agarose 



pH 5.7 



15 



*(Gibco BRL) 

Soybean embryonic suspension cultures were maintained in 35 mL liquid 
media (SB55) on a rotary shaker (1 50 rpm) at 28°C with a mix of fluorescent and 
incandescent lights providing a 16 h day 8 h night cycle. Cultures were 
subcultured every 2 to 3 weeks by inoculating approximately 35 mg of tissue into 
35 mL of fresh liquid media. 

Soybean embryonic suspension cultures were transformed with the plasmid 
containing the phytyl/prenyltransferase sequence (positive orientation) by the 
method of particle gun bombardment (see Klein et al. (1987) Nature 327:70-73) 
using a DuPont Biolistic PDS1000/He instrument Five pi. of pKS93s plasmid 
DNA (1 g/L), 50 |iL CaCI 2 (2.5 M), and 20 yL spermidine (0.1 M) were added to 
50 u.L of a 60 mg/mL 1 mm gold particle suspension. The particle preparation was 
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agitated for 3 minutes, spun on a microfuge for 10 seconds and the supemate 
removed. The DNA-coated particles were then washed once with 400 of 70% 
ethanol and resuspended in 40 nL of anhydrous ethanol. The DNA/particle 
suspension was sonicated three times for 1 second each. Five of the 
DNA-coated gold particles were then loaded on each macro carrier disk. 

Approximately 300 to 400 mg of two-week-old suspension culture was 
placed in an empty 60 mm X 15 mm petri dish and the residual liquid removed 
from the tissue using a pipette. The tissue was placed about 3.5 inches away 
from the retaining screen and bombarded twice. Membrane rupture pressure was 
set at 1 100 psi and the chamber was evacuated to -28 inches of Hg. Two plates 
were bombarded , and following bombardment, the tissue was divided in half, 
placed back into liquid media, and cultured as described above. 

Fifteen days after bombardment, the liquid media was exchanged with fresh 
SB55 containing 50 mg/mL hygromycin. The selective media was refreshed 
weekly. Six weeks after bombardment, green, transformed tissue was isolated 
and inoculated into flasks to generate new transformed embryonic suspension 
cultures. 

Transformed embryonic clusters were removed from liquid culture media and 
placed on a solid agar media, SB103, containing 0.5% charcoal to begin 
maturation. After 1 week, embryos were transferred to SB 103 media minus 
charcoal. After 5 weeks on SB 103 media, maturing embryos were separated and 
placed onto SB148 media. During maturation embryos were kept at 26°C with a 
mix of fluorescent and incandescent lights providing a 16 h day 8 h night cycle. 
After 3 weeks on SB148 media, embryos were analyzed for the expression of the 
tocopherols. Each embryonic cluster gave rise to 5 to 20 somatic embryos. 

Non-transformed somatic embryos were cultured by the same method as 
used for the transformed somatic embryos. 

r 

Analysis of Transformed Somatic Embryos 

At the end of 3 weeks on SB148 medium somatic embryos were harvested 
from 33 independently transformed lines. Pools of five embryos/event were 
pooled, the fresh weight noted, the embryos frozen on dry ice and lyophilized 
overnight. The corresponding dry weight was noted, the embryos pulverized with a 
glass rod and tocopherols and oil extracted by the addition of 0.5ml heptane (18h, 
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room temperature, dark). The embryos were re-extracted with 0.25ml of heptane 
the solutions pooled and centrifuged (5min, 12000g). The supernatant was stored 
in amber hplc autosampler vials at -20°c prior to analysis. 

HPLC analysis of the extracts was carried out using an HP1 100 system 

5 (Agilent Technologies). 25ul of the heptane sample was applied to a Lichrosphere 
Si 60 column (Smicron 4 x 12.5mm). The column was eluted with 
heptane/isopropanol (98:2 v/v) at a flow rate of 1ml/min. After 6minutes all four 
tocopherol isomers were eluted, as detected by a HP1100 fluorescence detector 
(excitation wavelength 295nm, emission wavelength 330nm). Individual tocopherol 

10 standards (Matreya) were diluted with HPLC grade heptane to levels between 1 
and 200ng/ul to construct a six point external standard curve. Tocopherols in each 
sample were quantified using a standard curve run on the same day as the 
samples. 

Total oil content of the samples was estimated by quantitative gas 
15 chromatography of the fatty acid methyl esters. 50ul samples were derivitized by 
addition to 0.5ml of a 1% (v/v) solution of sodium methoxide in methanol, 1 ug of 
undecanoic acid (17:0) dissolved in toluene was added as an internal standard. 
Derivitized fatty acids were extracted in 400ul heptane, fatty acids separated by 
glc and the peak heights quantitated by using a HP 6890 gas liquid 
20 chromatograph equipped with a fused silica capillary column 30m x i.d. 0.25mm 
coated with polar phase Omegawax 320 (Supelco In, Bellfonte,, PA), autosampler, 
flame ionization detector and ChemStation software on a HP 

The example shown in Table 5 shows the data from 33 independent 
transformed lines of somatic soy embryos (five pooled embryos per line) 
25 transformed with KS67 containing the maize phytyl/prenyltransferase in the 
positive orientation. Normal ratios of tocopherol (ngT) / oil (ugOil) in somatic 
embryos are 2-5. Overexpression of the phytyl/prenyltransferase has increased 
the amount of tocopherol relative to oil. In particular in samples 16 and 17 the 
ng/ugOil ratios have doubled to be 10.9 and 10.1 respectively. 
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TABLE 5 - Lines Transformed with KS67 in Positive Orientation 



Sample 


Oil (mg) 


Tocopherol (ng) 


ngT/ugOil 


1 


313.2 


1.25 


• 3.98 


2 


162.5 


0.51 


3.2 


3 


195.9 


1.2S 


6.6 


4 


133.7 


0.69 


5.2 


5 


323.5 


0.95 


2.9 


6 


18.6 


0.13 


7.1 


7 


121.3 


0.32 




8 


98.9 


0.73 


7 A 


9 


175.2 


0.5 


2 8 


10 


314.5 


1.3 




11 


99.4 


0.5 




12 


75.1 


0.23 


o 


13 


105.9 


0.59 


»J.5J 


14 


381.2 


1.15 


o 


15 


248.1 


1 44 


^ fi 


16 


103.8 


1 13 


in q 


17 


165 


1.67 


in 1 


18 


117.3 


0 5 




19 


255.7 


0.77 


o 


20 


365.1 


1 8 


A Q 


21 


253.9 


0 79 


O I 


22 


88.7 


0.59 


fi ft 


23 


454.2 


1 23 


9 7 


24 


352.5 


1.61 


4 ft 


25 


240.9 


0.63 


2 fi 


26 


404.2 


2.19 




27 


323 


1.52 


4.7 


28 


386.2 


2.28 


5.9 


29 


253.5 


1.28 


5 


30 


211.9 


1.35 


6.4 


31 


460.5 


1.3 


2.8 


32 


161.7 


1.19 


7.3 


33 


275.5 


1.66 


6 
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Table 6 - Detailed Analysis of each of the Five Embryos in Transformed Lines 15 
(control), 16, 17 and 18 (control). 



Sample 


Oil (mg) 


Tocopherol (ng) 


ngT/ugOil 


SC5 15-1 


1.36 


10.6 


7.8 


SC5 15-2 


0.75 


7.2 


9.7 


SC5 15-3 


0.93 


5.4 


5.8 


SC5 15-4 


6.67 


37.04 


5.6 


SC5 15-5 


1.35 


10.1 


7.5 


SC5 16-1 


0.8 


15.8 


19.7 


SC5 16-2 


0.4 


10.7 


26.8 


SC5 16-3 


4.21 


27.5 


6.5 


SC5 16-4 


0.2 


3.5 


17.5 


SC516-5 


2.7 


35.7 


13.2 


SC5 17-1 


0.4 


44.3 


111 


SC5 17-2 


0.2 


39.6 


200 


SC5 17-3 


5 


58 


11.7 


SC5 17-4 


1.29 


11.6 


9 


SC5 17-5 


24.7 


197.8 


13.5 


SC5 18-1 


32.1 


43.4 


1.4 


SC5 18-2 


31.6 


6.1 


1.9 


SC5 18-3 


2.99 


11.6 


3.9 


SC5 18-4 


0.7 


6.1 


8.7 


SC5 18-5 


0.5 


3.4 


7 



5 The single embryo analysis in Table 6 was conducted to confirm the pooled 
embryo data provided in Table 5. It should also be noted that an alternative 
embodiment of the invention involves somatic soy embryos transformed with KS67 
containing the maize phytyl/prenyltransferase in the reverse orientation. 

10 The above examples are provided to illustrate the invention but not to limit 

its scope. Other variants of the invention will be readily apparent to one of 
ordinary skill in the art and are encompassed by the appended claims. All 
publications, patents, patent applications, and computer programs cited herein are 
hereby incorporated by reference. 
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An isolated nucleic acid comprising a member selected from the group 
consisting of: 

(a) a polynucleotide that encodes a polypeptide of SEQ ID NO: 4, 12, 
14,16,18,20, 22, 24, 26 or 28; 

(b) a polynucleotide amplified from a plant tissue nucleic acid library using 
the primers of SEQ ID NOS: 5-8, provided the polynucleotide is not 
SEQ ID NO: 9 or the genomic sequence of SEQ ID NO: 1 or 9. 

(c) a polynucleotide comprising at least: 

(i) 280 contiguous bases of SEQ ID NO:1 , 

(ii) 20 contiguous bases of SEQ ID NO: 3, 

(iii) 30 contiguous bases of SEQ ID NO: 3; 

(iv) 50 contiguous bases of SEQ ID NO: 1 1 , 

(v) 50 contiguous bases of SEQ ID NO: 1 3, 

(vi) 297 contiguous bases of SEQ ID NO: 1 5, 

(vii) 20 contiguous bases of the coding region of SEQ ID NO: 17, or 

(viii) 30 contiguous bases of SEQ ID NO: 1 9, 21 , 23, 25, 27 or 29; 

(d) a polynucleotide encoding a plant or bacteria phytyl/prenyltransferase 
protein other than an Arabidopsis thaliana or Synechocystis 
phytyl/prenyltransferase protein; 

(e) a polynucleotide having at least 50% sequence identity to SEQ ID NO: 
3, wherein the % sequence identity is based on the entire coding 
sequence and is determined by BLAST 2.0 using default parameters; 

(f) a polynucleotide having 

(i) at least 70% sequence identity to SEQ ID NO: 3,11,13,17,19, 
23, 25, 27 or 29, 

(ii) at least 70% sequence identity to nucleotides spanning 
positions 226 to 1098 of SEQ ID NO: 15, 

(iii) at least 72% sequence identity to SEQ ID NO: 21 

wherein the % sequence identity is based on the coding sequence and 
is determined by GAP using default parameters; 
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(g) a polynucleotide having at least 90% sequence identity to SEQ ID NO: 
1 wherein the % sequence identity is based on the entire sequence 
and is determined by GAP using default parameters; 

(h) a polynucleotide which selectively hybridizes, under stringent 
hybridization conditions and a wash in 2X SSC at 50°C, to a 
hybridization probe the polynucleotide sequence of which consists of 
SEQ ID NO: 3, 11, 13, 15, 17, 19, 21, 23, 25, 27 or 29, or the 
complement of SEQ ID NO: 3, 11, 13, 15, 17, 19, 21 , 23, 25, 27 or 29, 
provided the polynucleotide is not SEQ ID NO: 9, a genomic sequence 
of SEQ ID NO: 1 or 9, a nucleotide sequence of any length in the 
region between positions 55 to 365 of SEQ ID NO: 15 or a nucleotide 
sequence of any length in the region between positions 801 to 1159 of 
SEQ ID NO: 17; 

(i) a polynucleotide comprising the sequence set forth in SEQ ID NO: 3, 
11, 13, 15, 17,19,21,23, 25, 27 or 29; 

0) a polynucleotide consisting of the sequence set forth in SEQ ID NO: 1 , 
and 

(k) a polynucleotide complementary to a polynucleotide of (a) through Q). 

The isolated nucleic acid of claim 1 wherein the polynucleotide of (c) further 
comprises contiguous nucleotides that encode for the first ten amino acids 
of SEQ ID NO: 4, 12, 14, 16, 18, 20, 22, 24, 26 or 28. 

The isolated nucleic acid of claim 1 wherein the phytyl/prenyltransferase 
polynucleotide of (d) is from maize, soybean, rice, wheat, Arabidopsis 
thaliana or Synechocystis. 

The isolated nucleic acid of claim 1 wherein the polynucleotide of (e) 
modulates a prenyllipid biosynthetic pathway. 

The isolated nucleic acid of claim 4 wherein 2-demethyl-phytylplastoquinol 
or 2-demethyl-plastoquinol-9 is modified. 
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6. The isolated nucleic acid of claim 1 wherein the polynucleotide of (f) 
modulates a prenyllipid biosynthetic pathway. 



7. 



The isolated nucleic acid of claim 6 wherein 2-demethyl-phytylplastoquinol 
or 2-demethyI-plastoquinol-9 is modified. 



The isolated nucleic acid of claim 1 wherein the polynucleotide of (h) 
comprises at least 25 nucleotides in length and hybridizes under stringent 
conditions including a wash with 0.1X SSC at 60°C to a hybridization probe 
the polynucleotide sequence of which consists of SEQ ID NO: 3, 11, 13, 15, 
17, 19, 21, 23, 25, 27 or 29, or the complement of SEQ ID NO: 3, 11, 13, 
15, 17, 19, 21, 23, 25, 27 or 29, provided the polynucleotide is not SEQ ID 
NO: 9, a genomic sequence of SEQ ID NO: 1 or 9, a nucleotide sequence 
of any length in the region between positions 55 to 365 of SEQ ID NO: 15 
or a nucleotide sequence of any length in the region between positions 801 
to1159ofSEQIDNO:17. 



20 



25 



9. The isolated nucleic acid of claim 8 wherein the isolated nucleic acid 
modulates a prenyllipid biosynthetic pathway. 

10. The isolated nucleic acid of claim 9 wherein 2-demethyl-phytylplastoquinol 
or 2-demethyl-plastoquinol-9 is modified. 

11. A vector comprising at least one nucleic acid of claim 1 or SEQ ID NO: 9. 

12. An expression cassette comprising at least one nucleic acid of claim 1 or 
SEQ ID NO: 9 operably linked to a promoter, wherein the nucleic acid is in 
sense or antisense orientation. 



30 13. A host cell into which is introduced with at least one expression cassette of 
claim 12. 

14. The host cell of claim 1 3 that is a plant cell. 
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15. A transgenic plant comprising at least one expression cassette of claim 1 3. 



16. The transgenic plant of claim 15, wherein the plant is corn, soybean, 
sunflower, sorghum, canola, wheat, alfalfa, cotton, rice, barley, millet, 

5 Arabidopsis thaliana, tomato, Brassica, vegetables, peppers, potatoes, 

apples, spinach, or lettuce. 

17. A seed from the transgenic plant of claim 16. 



10 18. The seed of claim 17, wherein the seed is from com, soybean, sunflower, 
sorghum, canola, wheat, alfalfa, cotton, rice, barley, millet, Arabidopsis 
thaliana, tomato, Brassica, vegetables, peppers, potatoes, apples, spinach, 
or lettuce. 



is 19. An isolated protein comprising a member selected from the group 
consisting of: 

(a) a polypeptide comprising at least 25 contiguous amino acids of SEQ 
ID NO: 2, 4, 10, 12, 14, 16, 18, 20, 22, 24, 26 or 28; 

(b) a polypeptide which is a plant or bacterial phytyl/prenyltransferase 
20 protein; 

(c) a polypeptide comprising at least 55% sequence identity to SEQ ID 
NO: 2 or 4, wherein the % sequence identity is based on the entire 
sequence and is determined by BLAST 2.0 using default parameters 
and has at least one epitope in common with a 

25 phytyl/prenyltransferase; 

(d) a polypeptide comprising at least 

(i) 75% sequence identity to SEQ ID NO: 2, 4, 10, 12, 14, 16, 18, 
20, 24, 26 or 28, or 

(ii) 77% sequence identity to SEQ ID NO: 22, 

30 wherein the % sequence identity is based on the entire sequence 

and is determined by GAP using default parameters and has at least 
one epitope in common with a phytyl/prenyltransferase; 

(e) a polypeptide encoded by a nucleic acid of SEQ ID NO: 1, 3, 9, 11, 13, 
15, 17, 19,21,23, 25, 27 or 29; and 
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(f) a polypeptide of SEQ ID NO: 2, 4, 10, 12, 14, 16, 18, 20, 22, 24, 26 or 

28. 

20. The protein of claim 19, wherein the polypeptide is catalytically active. 

21 . A ribonucleic acid sequence encoding the protein of claim 20. 

22. A method for modulating the level of phytyl/prenyltransferase protein in a 
plant, comprising: 

(a) stably transforming a plant cell with a phytyl/prenyltransferase 
polynucleotide operably linked to a promoter, wherein the 
polynucleotide is in sense or antisense orientation; 

(b) growing the plant cell under plant growing conditions to produce a 
regenerated plant capable of expressing the polynucleotide for a 
time sufficient to modulate the level of phytyl/prenyltransferase 
protein in the plant. 

23. The method of claim 22, wherein the phytyl/prenyltransferase 
polynucleotide is selected from those of SEQ ID NO: 1, 3, 9 # 11, 13, 15, 17, 
19,21,23, 25, 27 or 29. 

24. The method of claim 22, wherein the plant is com, soybean, sunflower, 
sorghum, canola, wheat, alfalfa, cotton, rice, barley, millet or Ambidopsis 
thaliana, tomato, Brassica, vegetables, peppers, potatoes, apples, spinach, 
or lettuce. 

25. The method of claim 22, wherein phytyl/prenyltransferase protein is 
increased. 

26. The method of claim 22, wherein phytyl/prenyltransferase protein is 
decreased. 



A method for modulating the level of tocopherol in a plant, comprising: 
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(a) stably transforming a plant cell with a phytyl/prenyltransferase 
polynucleotide operably linked to a promoter, wherein the 
polynucleotide is in sense or antisense orientation; 

(b) growing the plant cell under plant growing conditions to produce a 
regenerated plant capable of expressing the polynucleotide for a 
time sufficient to modulate level of tocopherol in the plant. 

28. The method of claim 27, wherein the phytyl/prenyltransferase 
polynucleotide is selected from SEQ ID NO 1, 3 f 9, 11, 13, 15, 17, 19, 21, 
23, 25, 27, or 29. 

29. A method for modulating the level of plastiquinone in a plant, comprising: 

(a) stably transforming a plant cell with a phytyl/prenyltransferase 
polynucleotide operably linked to a promoter, wherein the 
polynucleotide is in sense or antisense orientation; 

(b) growing the plant cell under plant growing conditions to produce a 
regenerated plant capable of expressing the polynucleotide for a 
time sufficient to modulate the level of plastiquinone in the plant. 

30. The method of claim 29, wherein the phytyl/prenyltransferase 
polynucleotide is selected from SEQ ID NO: 1, 3, 9, 11, 13, 15, 17, 19, 21, 
23, 25, 27 or 29. 
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SEQUENCE LISTING 



<110> Pioneer Hi-Bred International, Inc. & 
Board of Regents of The University and 
Community College System of Nevada on Behalf 
Of The University of Nevada, Reno 

<120> PHYTYL/ PRENYLTRANSFERASE NUCLEIC ACIDS , 
POLYPEPTIDES AND USES THEREOF 



<130> 1095-PCT 

<150> US 09/307,460 
<151> 1999-05-07 



<160> 29 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1616 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 

<222> (108) . . . (1286) 
<400> 1 

gttccttcaa aatcatttct ttctcttctt tgattcccaa agatcacttc tttgtctttg 60 
atttttgatt ttttttctct ctggcgtgaa ggaagaagct ttatttc atg gag tct lie 

Met Glu Ser 
1 

ctg etc tct agt tct tct ctt gtt tec get get ggt ggg ttt tgt tgg 164 
Leu Leu Ser Ser Ser Ser Leu Val Ser Ala Ala Gly Gly Phe Cys Trp 
5 10 is 

aag aag cag aat eta aag etc cac tct tta tea gaa ate cga gtt ctg 212 
Lys Lys Gin Asn Leu Lys Leu His Ser Leu Ser Glu lie Arg Val Leu 
20 25 30 35 

cgt tgt gat teg agt aaa gtt gtc gca aaa ccg aag ttt agg aac aat 260 
Arg Cys Asp Ser Ser Lys Val Val Ala Lys Pro Lys Phe Arg Asn Asn 
40 45 50 

ctt gtt agg cct gat ggt caa gga tct tea ttg ttg ttg tat cca aaa 308 
Leu Val Arg Pro Asp Gly Gin Gly Ser Ser Leu Leu Leu Tyr Pro Lys 
55 60 65 

cat aag teg aga ttt egg gtt aat gee act gcg ggt cag cct gag get 356 
His Lys Ser Arg Phe Arg Val Asn Ala Thr Ala Gly Gin Pro Glu Ala 
7 ° 75 Qo 

ttc gac teg aat age aaa cag aag tct ttt aga gac teg tta gat gcg 404 
Phe Asp ser Asn Ser Lys Gin Lys Ser Phe Arg Asp Ser Leu Asp Ala 
.85 90 95 
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ttt tac agg ttt tct agg cct cat aca gtt att ggc aca gtg ctt age 452 
Phe Tyr Arg Phe Ser Arg Pro His Thr Val He Gly Thr Val Leu Ser 
100 105 110 115 



att tta tct gta tct ttc tta gca gca gag aag gtt tct gat ata tct 500 
He Leu Ser Val Ser Phe Leu Ala Ala Glu Lys Val Ser Asp He Ser 
120 125 130 



cct tta ctt ttc act ggc ate ttg gag get gtt gtt gca get etc atg 548 
Pro Leu Leu Phe Thr Gly He Leu Glu Ala Val Val Ala Ala Leu Met 
135 140 145 

atg aac att tac ata gtt ggg eta aat cag ttg tct gat gtt gaa ata 596 
Met Asn He Tyr He Val Gly Leu Asn Gin Leu Ser Asp Val Glu He 
150 155 160 

gat aag gtt aac aag ccc tat ctt cca ttg gca tea gga gaa tat tct 644 
Asp Lys Val Asn Lys Pro Tyr Leu Pro Leu Ala Ser Gly Glu Tyr Ser 
165 170 175 

gtt aac ace ggc att gca ata gta get tec ttc tec ate atg agt ttc 692 
Val Asn Thr Gly He Ala He Val Ala Ser Phe Ser He Met Ser Phe 
180 185 190 195 



tgg ctt ggg tgg att gtt ggt tea tgg cca ttg ttc tgg get ctt ttt 740 
Trp Leu Gly Trp He Val Gly Ser Trp Pro Leu Phe Trp Ala Leu Phe 
200 205 210 



gtg agt ttc atg etc ggt act gca tac tct ate aat ttg cca ctt tta 788 
Val Ser Phe Met Leu Gly Thr Ala Tyr Ser He Asn Leu Pro Leu Leu 
215 220 225 



egg tgg aaa aga ttt gca 
Arg Trp Lys Arg Phe Ala 
230 

cga get att att gtt caa 
Arg Ala He He Val Gin 
245 

gtg ttt gga aga cca ate 
Val Phe Gly Arg Pro He 
260 265 



ttg gtt gca gca atg tgt 
Leu Val Ala Ala Met Cys 
235 

ate gec ttt tat eta cat 
He Ala Phe Tyr Leu His 
250 255 

ttg ttc act agg cct ctt 
Leu Phe Thr Arg Pro Leu 
270 



ate etc get gtc 836 

He Leu Ala Val 

240 

att cag aca cat 884 
He Gin Thr His 



att ttc gec act 932 
lie Phe Ala Thr 
275 



gcg ttt atg age ttt ttc tct gtc gtt att gca ttg ttt aag gat ata 980 
Ala Phe Met Ser Phe Phe Ser Val Val He Ala Leu Phe Lys Asp He 
280 285 290 



cct gat ate gaa ggg gat aag ata ttc gga ate cga tea ttc tct gta 1028 

Pro Asp He Glu Gly Asp Lys He Phe Gly He Arg Ser Phe Ser Val 
295 300 305 

act ctg ggt cag aaa egg gtg ttt tgg aca tgt gtt aca eta ctt caa 1076 

Thr Leu Gly Gin Lys Arg Val Phe Trp Thr Cys Val Thr Leu Leu Gin 
310 315 320 



atg get tac get gtt gca att eta gtt 
Met Ala Tyr Ala Val Ala He Leu Val 
325 330 

tgg age aaa gtc ate teg gtt gtg ggt 
Trp Ser Lys Val He Ser Val Val Gly 



gga gec aca tct cca ttc ata 1124 
Gly Ala Thr Ser Pro Phe He 
335 

cat gtt ata etc gca aca act 1172 
His Val He Leu Ala Thr Thr 
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340 



345 



350 



c ... 



355 



ttg tgg get cga get aag tec gtt gat ctg agt age aaa ace gaa ata 
Leu Trp Ala Arg Ala Lys Ser Val Asp Leu Ser Ser Lys Thr Glu He 
360 365 370 



1220 



act tea tgt tat atg ttc ata tgg aag etc ttt tat gca gag tac ttg 
Thr Ser Cys Tyr Met Phe He Trp Lys Leu Phe Tyr Ala Glu Tyr Leu 
375 380 385 



1268 



ctg tta cct ttt ttg aag tgactgacat tagaagagaa gaagatggag 
Leu Leu Pro Phe Leu Lys 
390 



1316 



ataaaagaat aagtcatcac tatgettctg tttttattac aagttcatga aattaggtag 1376 

tgaactagtg aattagagtt ttattctgaa acatggcaga ctgcaaaaat atgtcaaaga 1436 

tatgaatttc tgttgggtaa agaagtctct gcttgggcaa aatcttaagg ttcggtgtgt 1496 

tgatataatg etaagegaag aaatcgattc tatgtagaaa tttccgaaac tatgtgtaaa 1556 

catgtcagaa catctccatt ctatatcttc ttctgeaaga aagctctgtt tttatcacct 1616 

<210> 2 
<211> 393 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Glu Ser Leu Leu Ser Ser Ser Ser Leu Val Ser Ala Ala Gly Gly 

1 5 io 15 

Phe Cys Trp Lys Lys Gin Asn Leu Lys Leu His Ser Leu Ser Glu He 

20 25 30 

Arg Val Leu Arg Cys Asp Ser Ser Lys Val Val Ala Lys Pro Lys Phe 

35 40 45 

Arg Asn Asn Leu Val Arg Pro Asp Gly Gin Gly Ser Ser Leu Leu Leu 

50 55 60 

Tyr Pro Lys His Lys Ser Arg Phe Arg Val Asn Ala Thr Ala Gly Gin 
65 70 75 80 

Pro Glu Ala Phe Asp Ser Asn Ser Lys Gin Lys Ser Phe Arg Asp Ser 

85 90 95 

Leu Asp Ala Phe Tyr Arg Phe Ser Arg Pro His Thr Val He Gly Thr 

100 105 no 

Val Leu Ser He Leu Ser Val Ser Phe Leu Ala Ala Glu Lys Val Ser 

115 120 125 

Asp He Ser Pro Leu Leu Phe Thr Gly He Leu Glu Ala Val Val Ala 

130 135 140 

Ala Leu Met Met Asn He Tyr He Val Gly Leu Asn Gin Leu Ser Asp 
145 150 155 160 

Val Glu He Asp Lys Val Asn Lys Pro Tyr Leu Pro Leu Ala Ser Gly 

165 170 175 

Glu Tyr Ser Val Asn Thr Gly He Ala He Val Ala Ser Phe Ser He 

180 185 190 

Met Ser Phe Trp Leu Gly Trp He Val Gly Ser Trp Pro Leu Phe Trp 

195 200 205 

Ala Leu Phe Val Ser Phe Met Leu Gly Thr Ala Tyr Ser He Asn Leu 

210 215 220 

Pro Leu Leu Arg Trp Lys Arg Phe . Ala Leu Val Ala Ala Met Cys He 
225 230 235 2 40 

Leu Ala Val Arg Ala He He Val Gin He Ala Phe Tyr Leu His He 

245 250 255 

Gin Thr His Val Phe Gly Arg Pro He Leu Phe Thr Arg Pro Leu He 

260 265 270 

Phe Ala Thr Ala Phe Met Ser Phe Phe Ser Val Val He Ala Leu Phe 
275 280 285 
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Lys 


Asp 


He 


Pro 


Asp He 


Glu 


Glv 


Asp 




Tip 

J. ACS 




uiy 


T1a 
11C 


Arg 


oer 




290 








295 










300 








Phe 


Ser 


Val 


Thr 


Leu Gly 


Gin 


Lvs 


Ara 


Val 


Phe 


Trn 
iL P 


Thr 


cys 


vai 


inr 


305 








310 










315 










ion 


Leu 


Leu 


Gin 


Met 


Ala Tyr 


Ala 


Val 


Ala 


He 


Leu 


Val 
v ai 


uiy 




inr 


oer 










325 








330 










J J J 




Pro 


Phe 


He 


Tm 


Ser Lys 


Val 


Tip 
lie 


Ser 


vai 


vai 


tjiy 


HIS 


vai 


He 


Leu 








340 








345 








350 






Ala 


Thr 


Thr 


Leu 


Trp Ala 


Arg 


Ala 


Lys 


Ser 


Val 


Asp 


Leu 


Ser 


Ser 


Lys 






355 








360 










365 






Thr 


Glu 


He 


Thr 


Ser Cys 


Tyr 


Met 


Phe 


lie 


Trp 


Lys 


Leu 


Phe 


Tyr 


Ala 




370 








375 










380 








Glu 


Tyr 


Leu 


Leu 


Leu Pro 


Phe 


Leu 


Lys 
















385 








3 90 





















<210> 3 
<211> 1540 
<212> DNA 
<213> 2ea mays 

<220> 
<221> CDS 

<222> (21) . . . (1217) 

<221> misc_feature 
<222> (1) . . . (1540) 
<223> n = A,T,C or G 

<400> 3 

cgcgttcgcc eggecaaggg atg gac gcg ctt cgc eta egg ccg tec etc etc 53 

Met Asp Ala Leu Arg Leu Arg Pro Ser Leu Leu 
15 io 

ccc gtg egg ccc ggc gcg gec cgc ccg cga gat cat ttt eta cca cca 101 
Pro Val Arg Pro Gly Ala Ala Arg Pro Arg Asp His Phe Leu Pro Pro 
15 20 25 

tgt tgt tec ata caa cga aat ggt gaa gga cga att tgc ttt tct age 149 
Cys Cys Ser He Gin Arg Asn Gly Glu Gly Arg He Cys Phe Ser Ser 
30 35 40 

caa agg acc caa ggt cct acc ttg cat cac cat cag aaa ttc ttc gaa 197 
Gin Arg Thr Gin Gly Pro Thr Leu His His His Gin Lys Phe Phe Glu 
45 50 55 

tgg aaa tec tec tat tgt agg ata tea cat egg tea tta aat act tct 245 
Trp Lys Ser Ser Tyr Cys Arg He Ser His Arg Ser Leu Asn Thr Ser 
60 65 70 75 

gtt aat get teg ggg caa cag ctg cag tct gaa cct gaa aca cat gat 293 
Val Asn Ala Ser Gly Gin Gin Leu Gin Ser Glu Pro Glu Thr His Asp 
80 85 90 

tct aca acc ate tgg agg gca ata tea tct tct eta gat gca ttt tae 341 
Ser Thr Thr He Trp Arg Ala He Ser Ser Ser Leu Asp Ala Phe Tyr 
95 100 105 

aga ttt tec egg cca cat act gtc ata gga aca gca tta age ata gtc 389 
Arg Phe Ser Arg Pro His Thr Val He Gly Thr Ala Leu Ser He Val 
HO US 120 



tea gtt tec ctt eta get gtc cag age ttg tct gat ata tea cct ttg 



437 
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Ser Val Ser Leu Leu Ala Val Gin Ser Leu Ser Asp lie Ser Pro Leu 
125 130 135 

ttc etc act ggt ttg ctg gag gca gtg gta get gec ctt ttc atg aat 4 65 

Phe Leu Thr Gly Leu Leu Glu Ala Val Val Ala Ala Leu Phe Met Asn 
140 145 150 155 

ate tat att gtt gga ctg aac cag tta ttc gac att gag ata gac aag 533 
lie Tyr He Val Gly Leu Asn Gin Leu Phe Asp He Glu He Asp Lys 
160 165 170 

gtt aac aag cca act ctt cca ttg gca tct ggg gaa tac acc ctt gca 581 
Val Asn Lys Pro Thr Leu Pro Leu Ala Ser Gly Glu Tyr Thr Leu Ala 
175 180 185 

act ggg gtt gca ata gtt teg gtc ttt gec get atg age ttt ggc ctt 629 
Thr Gly Val Ala He Val Ser Val Phe Ala Ala Met Ser Phe Gly Leu 
!90 195 200 

gga tgg get gtt gga tea caa cct ctg ttt tgg get ctt ttc ata age 677 
Gly Trp Ala Val Gly Ser Gin Pro Leu Phe Trp Ala Leu Phe He Ser 
205 210 215 

ttt gtt ctt ggg act gca tat tea ate aat ctg ccg tac ctt cga tgg 725 
Phe Val Leu Gly Thr Ala Tyr Ser He Asn Leu Pro Tyr Leu Arq Trp 
220 225 230 235 

aag aga ttt get gtt gtt gca gca ctg tgc ata tta gca gtt cgt gca 773 
Lys Arg Phe Ala Val Val Ala Ala Leu Cys He Leu Ala Val Arg Ala 
2 *° 245 250 

gtg att gtt cag ctg gee ttt ttt etc cac, att cag act ttt gtt ttc 821 
Val He Val Gin Leu Ala Phe Phe Leu His He Gin Thr Phe Val Phe 
255 260 265 

agg aga ccg gca gtg ttt tct agg cca tta tta ttt gca act gga ttt 869 
Arg Arg Pro Ala Val Phe Ser Arg Pro Leu Leu Phe Ala Thr Gly Phe 
2 ™ 275 280 

atg acg ttc ttc tct gtt gta ata gca eta ttc aag gat ata cct gac 917 
Met Thr Phe Phe Ser Val Val He Ala Leu Phe Lys Asp He Pro Asp 
285 290 295 

ate gaa ggg gac cgc ata ttc ggg ate cga tec ttc age gtc egg tta 965 
He Glu Gly Asp Arg He Phe Gly He Arg Ser Phe Ser Val Arg Leu 
300 305 310 315 

ggg caa aag aag gtc ttt tgg ate tgc gtt ggc ttg ctt gag atg gee 1013 
Gly Gin Lys Lys Val Phe Trp He Cys Val Gly Leu Leu Glu Met Ala 
320 325 336 

tac age gtt gcg ata ctg atg gga get acc tct tec tgt ttg tgg age 1061 
Tyr Ser Val Ala He Leu Met Gly Ala Thr Ser Ser Cys Leu Trp Ser 
335 340 345 

aaa aca gca acc ate get ggc cat tec ata ctt gee gcg ate eta tgg H09 
Lys Thr Ala Thr He Ala Gly His Ser He Leu Ala Ala He Leu Trp 
35 0 355 360 

age tgc gcg cga teg gtg gac ttg acg age aaa gee gca ata acg tec 1157 
Ser Cys Ala Arg Ser Val Asp Leu Thr Ser Lys Ala Ala He Thr Ser 
365 370 375 
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ttc tac atg ttc ate tgg aag ctg ttc tac gcg gag tac ctg etc ate 1205 
Phe Tyr Met Phe lie Trp Lys Leu Phe Tyr Ala Glu Tyr Leu Leu lie 
380 385 390 395 

cct ctg gtg egg tgagegegag gcgaggtggt ggcagacgga teggegtegg 1257 
Pro Leu Val Arg 



eggggeggea aacaactcca egggagaact tgagtgccgg aagtaaactc ccgtttgaaa 1317 

gttgaagcgt gcaccaccgg caccgggcag agagagacac ggtggctgga tggatacgga 1377 

tggccccccc aataaattcc cccgtgcatg gtaccccacg ctgcttgatg atatcccatg 1437 

tgtccgggtg accggacctg ategtctcta aanagattgg ttgcaaaaaa aaaaaaaaaa 1497 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aag 1540 

<210> 4 
<211> 399 
<212> PRT 
<213> Zea mays 

<400> 4 

Met Asp Ala Leu Arg Leu Arg Pro Ser Leu Leu Pro Val Arg Pro Gly 

15 10 15 

Ala Ala Arg Pro Arg Asp His Phe Leu Pro Pro Cys Cys Ser He Gin 

20 25 30 

Arg Asn Gly Glu Gly Arg He Cys Phe Ser Ser Gin Arg Thr Gin Gly 

35 40 45 

Pro Thr Leu His His His Gin Lys Phe Phe Glu Trp Lys Ser Ser Tyr 

50 55 60 

Cys Arg He Ser His Arg Ser Leu Asn Thr Ser Val Asn Ala Ser Gly 
65 70 75 80 

Gin Gin Leu Gin Ser Glu Pro Glu Thr His Asp Ser Thr Thr He Trp 

85 90 95 

Arg Ala He Ser Ser Ser Leu Asp Ala Phe Tyr Arg Phe Ser Arg Pro 

100 105 x HO 

His Thr Val He Gly Thr Ala Leu Ser He Val Ser Val Ser Leu Leu 

115 120 125 

Ala Val Gin Ser Leu Ser Asp He Ser Pro Leu Phe Leu Thr Gly Leu 

130 135 140 

Leu Glu Ala Val Val Ala Ala Leu Phe Met Asn He Tyr He Val Gly 
145 150 155 160 

Leu Asn Gin Leu Phe Asp He Glu He Asp Lys Val Asn Lys Pro Thr 

165 170 175 

Leu Pro Leu Ala Ser Gly Glu Tyr Thr Leu Ala Thr Gly Val Ala He 

180 185 190 

Val Ser Val Phe Ala Ala Met Ser Phe Gly Leu Gly Trp Ala Val Gly 

195 200 205 

Ser Gin Pro Leu Phe Trp Ala Leu Phe He Ser Phe Val Leu Gly Thr 

210 215 220 

Ala Tyr Ser He Asn Leu Pro Tyr Leu Arg Trp Lys Arg Phe Ala Val 
225 230 235 240 

Val Ala Ala Leu Cys He Leu Ala Val Arg Ala Val lie Val Gin Leu 

245 250 2S5 

Ala Phe Phe Leu His He Gin Thr Phe Val Phe Arg Arg Pro Ala Val 

260 265 270 

Phe Ser Arg Pro Leu Leu Phe Ala Thr Gly Phe Met Thr Phe Phe Ser 

275 280 285 

Val Val He Ala Leu Phe Lys Asp He Pro Asp He Glu Gly Asp Arg 

290 295 300 

He Phe Gly He Arg Ser Phe Ser Val Arg Leu Gly Gin Lys Lys Val 
305 310 315 320 

Phe Trp He Cys Val Gly Leu Leu Glu Met Ala Tyr Ser Val Ala He 
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Leu 


Met 


Gly 


Ala 


Thr 


Ser 


Ser Cys 


Leu 








340 








345 


Ala 


Gly His 


Ser 


lie 


Leu 


Ala Ala 


lie 






355 








360 




Val 


Asp 


Leu 


Thr 


Ser 


Lys 


Ala Ala 


lie 




370 










375 




Trp 


Lys 


Leu 


Phe 


Tyr 


Ala 


Glu Tyr 


Leu 


385 










390 







<210> 5 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Arabidopsis thaliana 
<400> 5 

ttgttttcag gctgttgttg cagctctc 

<210> 6 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Arabidopsis thaliana 
<400> 6 

cgtttctgac ccagagttac agagaatg 

<210> 7 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synechocystis 
<400> 7 

tattcatatg gcaactatcc aagctttttg 

<210> 8 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synechocystis 
<400> 8 

ggatcctaat tgaagaagat actaaatagt tc 

<210> 9 
<211> 927 
<212> DNA 

<213> Synechocystis 

<220> 

<221> CDS 

<222> (1)...(927) 



PCT/US00/1I439 



330 335 
Trp Ser Lys Thr Ala Thr He 
350 

Leu Trp Ser Cys Ala Arg Ser 
365 

Thr Ser Phe Tyr Met Phe He 
380 

Leu He Pro Leu Val Arg 
395 
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<223> 
<400> 9 

atg gca act ate caa get ttt tgg cgc ttc tec cgc ccc cat acc ate 4 8 

Met Ala Thr lie Gin Ala Phe Trp Arg Phe Ser Arg Pro His Thr He 
15 10 15 

att ggt aca act ctg age gtc tgg get gtg tat ctg tta act att etc 96 

He Gly Thr Thr Leu Ser Val Trp Ala Val Tyr Leu Leu Thr He Leu 
20 25 30 

ggg gat gga aac tea gtt aac tec cct get tec ctg gat tta gtg ttc 144 

Gly Asp Gly Asn Ser Val Asn Ser Pro Ala Ser Leu Asp Leu Val Phe 
35 40 45 

ggc get tgg ctg gee tgc ctg ttg ggt aat gtg tac att gtc ggc etc 192 

Gly Ala Trp Leu Ala Cys Leu Leu Gly Asn Val Tyr He Val Gly Leu 

50 55 60 

aac caa ttg tgg gat gtg gac att gac cgc ate aat aag ccg aat ttg 240 

Asn Gin Leu Trp Asp Val Asp He Asp Arg He Asn Lys Pro Asn Leu 
65 70 75 80 

ccc eta get aac gga gat ttt tct ate gee cag ggc cgt tgg att gtg 2 88 

Pro Leu Ala Asn Gly Asp Phe Ser lie Ala Gin Gly Arg Trp He Val 
85 90 95 

gga ctt tgt ggc gtt get tec ttg gcg ate gee tgg gga tta ggg eta 33 6 

Gly Leu Cys Gly Val Ala Ser Leu Ala He Ala Trp Gly Leu Gly Leu 
100 105 110 

tgg ctg ggg eta acg gtg ggc att agt ttg att att ggc acg gec tat 3 84 

Trp Leu Gly Leu Thr Val Gly He Ser Leu He He Gly Thr Ala Tyr 

115 120 125 

teg gtg ccg cca gtg agg tta aag cgc ttt tec ctg ctg gcg gee ctg 432 

Ser Val Pro Pro Val Arg Leu Lys Arg Phe Ser Leu Leu Ala Ala Leu 

130 135 140 

tgt att ctg acg gtg egg gga att gtg gtt aac ttg ggc tta ttt tta 480 

Cys lie Leu Thr Val Arg Gly lie Val Val Asn Leu Gly Leu Phe Leu 
145 150 155 160 

ttt ttt aga att ggt tta ggt tat ccc ccc act tta ata acc ccc ate 528 

Phe Phe Arg lie Gly Leu Gly Tyr Pro Pro Thr Leu lie Thr Pro He 
165 170 175 

tgg gtt ttg act tta ttt ate tta gtt ttc acc gtg gcg ate gec att 576 

Trp Val Leu Thr Leu Phe lie Leu Val Phe Thr Val Ala He Ala He 
180 185 190 

ttt aaa gat gtg cca gat atg gaa ggc gat egg caa ttt aag att caa 624 

Phe Lys Asp Val Pro Asp Met Glu Gly Asp Arg Gin Phe Lys He Gin 

195 200 205 

act tta act ttg caa ate ggc aaa caa aac gtt ttt egg gga acc tta 672 

Thr Leu Thr Leu Gin lie Gly Lys Gin Asn Val Phe Arg Gly Thr Leu 

210 215 220 



att tta etc act ggt tgt tat tta gee atg gca ate tgg ggc tta tgg 
lie Leu Leu Thr Gly Cys Tyr Leu Ala Met Ala lie Trp Gly Leu Trp 
225 230 235 240 



720 



WO 00/68393 



9 



PCT/USOO/11439 



gcg get atg cct tta aat act get ttc ttg att gtt tec cat ttg tgc 768 
Ala Ala Met Pro Leu Asn Thr Ala Phe Leu He Val Ser His Leu Cys 
245 250 255 

tta tta gec tta etc tgg tgg egg agt cga gat gta cac tta gaa age 816 
Leu Leu Ala. Leu Leu Trp Trp Arg Ser Arg Asp Val His Leu Glu Ser 
260 265 270 

aaa acc gaa att get agt ttt tat cag ttt att tgg aag eta ttt ttc 864 
Lys Thr Glu lie Ala Ser Phe Tyr Gin Phe He Trp Lys Leu Phe Phe 
275 280 285 

tta gag tac ttg ctg tat ccc ttg get ctg tgg tta cct aat ttt tct 912 
Leu Glu Tyr Leu Leu Tyr Pro Leu Ala Leu Trp Leu Pro Asn Phe Ser 
290 295 300 

aat act att ttt tag 927 

Asn Thr He Phe * 

305 



<210> 10 

<211> 308 

<212> PRT 

<213> Synechocystis 

<400> 10 



Met 


Ala 


Thr 


He 


Gin 


Ala 


Phe 


Trp 


Arg 


Phe Ser Arg Pro His 


Thr 


He 


1 








5 










10 


15 




He 


Gly 


Thr 


Thr 


Leu 


Ser 


Val 


Trp 


Ala 


Val Tyr Leu Leu Thr 


He 


Leu 








20 










25 


30 






Gly 


Asp 


Gly 


Asn 


Ser 


Val 


Asn 


Ser 


Pro 


Ala Ser Leu Asp Leu 


Val 


Phe 






35 










40 




45 






Gly 


Ala 


Trp 


Leu 


Ala 


Cys 


Leu 


Leu 


Gly Asn Val Tyr He Val 


Gly Leu 




50 










55 






60 






Asn 


Gin 


Leu 


Trp 


Asp 


Val 


Asp 


He 


Asp Arg lie Asn Lys Pro 


Asn 


Leu 


65 










70 








75 




80 


Pro 


Leu 


Ala 


Asn 


Gly 


Asp 


Phe 


Ser 


He 


Ala Gin Gly Arg Trp 


He 


Val 










85 










90 


95 




Gly 


Leu 


Cys 


Gly 


Val 


Ala 


Ser 


Leu 


Ala 


He Ala Trp Gly Leu 


Gly Leu 








100 










105 


110 






Trp 


Leu 


Gly 


Leu 


Thr 


Val 


Gly 


He 


Ser 


Leu He He Gly Thr 


Ala 


Tyr 






115 










120 




125 




Ser 


Val 


Pro 


Pro 


Val 


Arg 


Leu 


Lys 


Arg 


Phe Ser Leu Leu Ala 


Ala 


Leu 




130 










135 






140 






Cys 


He 


Leu 


Thr 


Val 


Arg 


Gly 


He 


Val 


Val Asn Leu Gly Leu 


Phe 


Leu 


145 










150 








155 




160 


Phe 


Phe 


Arg 


He 


Gly 


Leu 


Gly Tyr 


Pro 


Pro Thr Leu He Thr 


Pro 


He 










165 










170 


175 




Trp 


Val 


Leu 


Thr 


Leu 


Phe 


He 


Leu 


Val 


Phe Thr Val Ala He 


Ala 


He 








180 










185 


190 






Phe 


Lys 


Asp 


Val 


Pro 


Asp 


Met 


Glu Gly Asp Arg Gin Phe Lys 


He 


Gin 






195 










200 




205 






Thr 


Leu 


Thr 


Leu 


Gin 


He 


Gly Lys Gin Asn Val Phe Arg Gly 


Thr 


Leu 




210 










215 






220 






He 


Leu 


Leu 


Thr 


Gly 


Cys 


Tyr 


Leu 


Ala 


Met Ala He Trp Gly 


Leu 


Trp 


225 










230 








235 




240 


Ala 


Ala 


Met 


Pro 


Leu 


Asn 


Thr 


Ala 


Phe 


Leu He Val Ser His 


Leu 


Cys 










245 










250 


255 


Leu 


Leu 


Ala 


Leu 


Leu 


Trp 


Trp Arg Ser Arg Asp Val His Leu 


Glu 


Ser 








260 










265 


270 
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Lys Thr Glu He Ala Ser Phe Tyr Gin Phe He Trp Lys Leu Phe Phe 

275 280 285 

Leu Glu Tyr Leu Leu Tyr Pro Leu Ala Leu Trp Leu Pro Asn Phe Ser 

290 295 300 

Asn Thr He Phe 
305 

<210> 11 
<211> 1278 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (106) . . . (879) 
<400> 11 

tcgcaaagac gctgcatgcc ttctatcagt tctgccgacc acacacaata tttggaacca 60 
taataggcat tacttcggtg tctatcctgc cagtgaaaga gcctg gac gat ttt acg 117 

Asp Asp Phe Thr 
1 

ttg ata get ata tgg gga ttt etc gag get ttg gee gee gca tta tgt 165 
Leu He Ala He Trp Gly Phe Leu Glu Ala Leu Ala Ala Ala Leu Cys 
5 10 15 20 

atg aac gtt tat gta gta ggg ctg aac aag gtc aat aag cca acc etc 213 
Met Asn Val Tyr Val Val Gly Leu Asn Lys Val Asn Lys Pro Thr Leu 
25 30 35 

cca tta teg ttc gga gag ttt tea atg cca act gca gta ttg tta gta 261 
Pro Leu Ser Phe Gly Glu Phe Ser Met Pro Thr Ala Val Leu Leu Val 
40 45 50 

gtg gca ttc ttg gtc atg age att age ate gga ata aga tea aag tet 309 
Val Ala Phe Leu Val Met Ser He Ser He Gly He Arg Ser Lys Ser 
55 60 65 

get cca ttg atg tgt get ttg ctt gtt tgc ttc ctt ctt gga age gca 357 
Ala Pro Leu Met Cys Ala Leu Leu Val Cys Phe Leu Leu Gly Ser Ala 
70 75 80 

tac ccc att gac gtc cca tta etc egg tgg aag cga cat get ttt eta 405 
Tyr Pro He Asp Val Pro Leu Leu Arg Trp Lys Arg His Ala Phe Leu 
85 90 95 100 

get gca ttc tgc ata ate ttt gtg agg cct gta gtg gtc cag tta get 453 
Ala Ala Phe Cys He He Phe Val Arg Pro Val Val Val Gin Leu Ala 
105 HO lis 

ttc ttt gca cac atg cag caa cat gtt ctg aag agg ccc ttg gca cct 501 
Phe Phe Ala His Met Gin Gin His Val Leu Lys Arg Pro Leu Ala Pro 
120 125 130 

aca agg teg gtg gtc ttt gca aca tgt ttc atg tgt tgc ttc get gca 549 
Thr Arg Ser Val Val Phe Ala Thr Cys Phe Met Cys Cys Phe Ala Ala 
135 140 145 

gta ata gcg eta ttc aag gat att cct gat gtc gat gga gat aga gat 597 
Val He Ala Leu Phe Lys Asp He Pro Asp Val Asp Gly Asp Arg Asp 
150 155 160 
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ttc ggc att cag tec atg act gta cga tta ggc caa cag aga gtg cat 645 
Phe Gly He Gin Ser Met Thr Val Arg Leu Gly Gin Gin Arg Val His 
165 170 175 180 

agg etc tgc att aat att etc atg aca gca tac gca gec gca att ttg 693 
Arg Leu Cys He Asn He Leu Met Thr Ala Tyr Ala Ala Ala He Leu 
185 190 195 

gta ggc gcg tea tct acg aac ctg tat cag aag att gtc att gtg tct 
Val Gly Ala Ser Ser Thr Asn Leu Tyr Gin Lys He Val He Val Ser 
200 205 210 

ggt cat ggc ttg ctt gec tec aca etc tgg caa aga gca caa caa ttt 
Gly His Gly Leu Leu Ala Ser Thr Leu Trp Gin Arg Ala Gin Gin Phe 
215 220 225 

gac att gag aat aag gat tgt ate aca caa ttt tat atg ttc att tgg 
Asp He Glu Asn Lys Asp Cys He Thr Gin Phe Tyr Met Phe He Trp 
230 235 240 

aag tta ttc tac gee gag tat ttt ctt ata cca ttt gtg tag 
Lys Leu Phe Tyr Ala Glu Tyr Phe Leu He Pro Phe Val * 
245 250 255 

taaagaatca tgegaagaac aacacccctg ctatagacat gtgaaggttt attgetaatg 93 9 

ttactctacc ccctgctata gacatgtgaa ggtttattgc taatgttact ctaccgaatg 999 

gtctgaatgt etatgegtea tttgaatgta atatgactat ttgttgtatc agggtaacaa 1059 

ctggagcaaa tgtaccatgt atattaagca ttaatttaac tgcatcattt gtaccatgta 1119 

tattatgact atgtatgaga tattgtctct tattagtact ggatgtgatg tgtcttatta 1179 

tgactatgga tgagactttt gtgatgtaat tgatgagact atggttttaa atattgttat 123 9 

gtgattgtgt gtgagataaa aaaaaaaaaa aaaaaaaaa 127 8 





<210> 


12 
















<211> 


257 
















<212> 


PRT 
















<213> 


Zea 


mays 












<400> 


12 














Asp 


Asp 


Phe 


Thr 


Leu 


He 


Ala 


He 


Trp Gly Phe 


1 








5 










10 


Ala 


Ala 


Leu 


Cys 


Met 


Asn 


Val 


Tyr Val 


Val Gly 








20 










25 


Lys 


Pro 


Thr 


Leu 


Pro 


Leu 


Ser 


Phe Gly Glu Phe 






35 










40 






Val 


Leu 


Leu 


Val 


Val 


Ala 


Phe 


Leu 


Val 


Met Ser 




50 










55 








Arg 


Ser 


Lys 


Ser 


Ala 


Pro 


Leu 


Met 


Cys 


Ala Leu 


65 










70 






75 


Leu 


Gly 


Ser 


Ala 


Tyr 


Pro 


He 


Asp 


Val 


Pro Leu 










85 










90 


His 


Ala 


Phe 


Leu 


Ala 


Ala 


Phe 


Cys 


He 


He Phe 








100 










105 




Val 


Gin 


Leu 


Ala 


Phe 


Phe 


Ala 


His 


Met 


Gin Gin 






115 










120 






Pro 


Leu 


Ala 


Pro 


Thr 


Arg Ser 


Val 


Val 


Phe Ala 




130 










135 








Cys 


Phe 


Ala 


Ala 


Val 


He 


Ala 


Leu 


Phe 


Lys Asp 


145 










150 








155 


Gly 


Asp 


Arg 


Asp 


Phe 


Gly He 


Gin 


Ser 


Met Thr 










165 










170 


Gin 


Arg 


Val 


His 


Arg 


Leu 


Cys 


He 


Asn 


He Leu 








180 










185 





15 

Leu Asn Lys Val Asn 
30 

Ser Met Pro Thr Ala 
45 

He Ser He Gly He 
60 

Leu Val Cys Phe Leu 
80 

Leu Arg Trp Lys Arg 
95 

Val Arg Pro Val Val 
110 

His Val Leu Lys Arg 
125 

Thr Cys Phe Met Cys 
140 

He Pro Asp Val Asp 
160 

Val Arg Leu Gly Gin 
175 

Met Thr Ala Tyr Ala 
190 



741 



789 



837 



879 
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nla 


r\.±. d 


Tip, 


Leu 


va± 


Gly Ala 


Ser 


Ser 


Thr 


Asn 


Leu 


Tyr Gin 


Lys 


He 






- 195 








200 










205 




val 


He 


Val 


Ser 


Gly 


His Gly 


Leu 


Leu 


Ala 


Ser 


Thr 


Leu Trp 


Gin 


Arg 




210 








215 










220 






Ala 


Gin 


Gin 


Phe 


Asp 


He Glu 


Asn 


Lys 


Asp 


Cys 


He 


Thr Gin 


Phe 


Tyr 


225 










230 








235 








240 


Met 


Phe 


He 


Trp 


Lys 
245 


Leu Phe 


Tyr 


Ala 


Glu 
250 


Tyr 


Phe 


Leu He 


Pro 
255 


Phe 


Val 





























<210> 13 
<211> 1771 
<212> DNA 
<213> Zea mays 





<220> 






























<221> 


CDS 




























<222> 


(1) 


. . . (1149) 






















<400> 


13 


























cca 
Pro 
1 


cgc 
Arg 


gtc 
Val 


egg 
Arg 


gcc 
Ala 
5 


tec 
Ser 


ctt 
Leu 


cct 
Pro 


etc 
Leu 


ccg 
Pro 
10 


ccc 
Pro 


agt 
Ser 


act 
Thr 


gcc 
Ala 


gtc 
Val 
15 


acc 
Thr 


get 
Ala 


cgc 
Arg 


ttc 
Phe 


etc 
Leu 
20 


gcc 
Ala 


gcc 
Ala 


ccc 
Pro 


gcc 
Ala 


ate 
He 
25 


cgc 
Arg 


gtg 
Val 


ate 
He 


age 
Ser 


cca 
Pro 
30 


teg 
Ser 


agg 
Arg 


ccc 
Pro 


gcg 

Ala 


ctg 
Leu 

o c 
35 


ccg 
Pro 


etc 
Leu 


etc 
Leu 


tea 
Ser 


tec 
Ser 
40 


gcc 
Ala 


tec 
Ser 


gca 
Ala 


ggc ggc ttc 
Gly Gly Phe 
45 


cct 
Pro 


cac 
His 


gcc 
Ala 


tct 
Ser 
50 


cgc 
Arg 


get 
Ala 


ccc 
Pro 


tgc 
Cys 


agt 
Ser 
55 


gcc 
Ala 


gcc 
Ala 


cgc 
Arg 


gag 
Glu 


cac 
His 
60 


cgc 
Arg 


cgc 
Arg 


ggc acc 
Gly Thr 


gtg 
Val 
65 


egg 
Arg 


gaa 
Glu 


tgc 
Cys 


tct 
Ser 


cga get 
Arg Ala 
70 


gat get get gga 
Asp Ala Ala Gly 
75 


gca 
Ala 


get 
Ala 


cca 
Pro 


tta 
Leu 


tea 
Ser 
80 


aag 
Lys 


aca 
Thr 


ctg 
Leu 


tta 
Leu 


gac 
Asp 
85 


etc 
Leu 


aag 
Lys 


gat 
Asp 


tec 
Ser 


tgc 
Cys 
90 


tgg 
Trp 


aga 
Arg 


ttt 
Phe 


tta 
Leu 


agg 
Arg 
95 


cca 
Pro 


cat 
His 


aca 
Thr 


ate 
He 


cga 
Arg 
100 


gga 
Gly 


act 
Thr 


get 
Ala 


tta 
Leu 


gga tec 
Gly Ser 
105 


ata 
He 


gca 
Ala 


ttg 
Leu 


gtt 
Val 
110 


gcg 
Ala 


aga 
Arg 


gcc 
Ala 


ttg 
Leu 


ata 
He 
115 


gag 
Glu 


aat 
Asn 


tec 
Ser 


cat 
His 


ctg 
Leu 
120 


ata 
He 


aac 
Asn 


tgg 
Trp 


tgg 
Trp 


ttg 
Leu 
125 


ata 
He 


ttc 
Phe 


aaa 
Lys 


gca 
Ala 


ttc 
Phe 
130 


tat 
Tyr 


gga 

Gly 


ctt 
Leu 


ggg gca 
Gly Ala 
135 


ttg 
Leu 


ata 
He 


ttt 
Phe 


ggc 
Gly 


aat 
Asn 
140 


ggt tac 
Gly Tyr 


ata 
He 


gtt 
Val 


ggg 

Gly 
145 


att 
He 


aat 
Asn 


cag 
Gin 


ate 
He 


tat gat 
Tyr Asp 
150 


gtt 
Val 


get 
Ala 


att gac 
He Asp 
155 


aag 
Lys 


gta 
Val 


aac 
Asn 


aag 
Lys 


cca 
Pro 
160 



4 8 



96 



144 



192 



240 



288 



336 



384 



432 



480 



tat tta ccc att get get ggt gat etc tea att cag tea gca tgg ttg 



528 
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Tyr Leu Pro He Ala Ala Gly Asp Leu Ser He Gin Ser Ala Trp Leu 

165 170 175 



ttg gtg ata tta ttt gca get gca ggt ttt tea att gtt ata tea aac 
Leu Val He Leu Phe Ala Ala Ala Gly Phe Ser He Val He Ser Asn 
180 185 190 



cat get gtc ctt get ggt ggt tta att ttc cag aca tgg gtt ctg gag 

His Ala Val Leu Ala Gly Gly Leu He Phe Gin Thr Trp Val Leu Glu 
340 345 350 

caa gcg aag tac aga aag gat get att teg cag tac tat egg ttc ata 

Gin Ala Lys Tyr Arg Lys Asp Ala He Ser Gin Tyr Tyr Arg Phe He 
355 360 365 

tgg aat etc ttc tat get gaa tat ate ttc ttc ccg tta ata tag 

Trp Asn Leu Phe Tyr Ala Glu Tyr He Phe Phe Pro Leu He * 
370 375 380 



576 



ttt gga cct ttc att ace tct eta tac tgc ctt ggc eta ttt ctt ggc 624 
Phe Gly Pro Phe He Thr Ser Leu Tyr Cys Leu Gly Leu Phe Leu Gly 
195 200 205 

act ata tat tct gtt cct cca ttt aga ctg aag aga tat ccg gtt get 672 
Thr He Tyr Ser Val Pro Pro Phe Arg Leu Lys Arg Tyr Pro Val Ala 
210 215 220 

get ttt ctt ate att gca acg gtt cgt ggt ttc ctt etc aac ttt ggc 720 
Ala Phe Leu He He Ala Thr Val Arg Gly Phe Leu Leu Asn Phe Gly 
225 230 235 240 

gtg tac tat get act agg get gca eta ggt ctt aca ttc caa tgg age 768 
Val Tyr Tyr Ala Thr Arg Ala Ala Leu Gly Leu Thr Phe Gin Trp Ser 
245 250 255 

tec cct gtt get ttc att aca tgc ttc gtg aca eta ttt get ttg gtc 816 
Ser Pro Val Ala Phe He Thr Cys Phe Val Thr Leu Phe Ala Leu Val 
260 265 270 

att get ata ace aaa gat etc cct gat gtt gaa gga gat cgc aag tat 864 
He Ala He Thr Lys Asp Leu Pro Asp Val Glu Gly Asp Arg Lys Tyr 
275 280 285 

caa ata tea act ttg gca aca aag ctt ggt gtc aga aat att gca ttc 912 
Gin He Ser Thr Leu Ala Thr Lys Leu Gly Val Arg Asn He Ala Phe 
290 295 300 

ctt gga tct ggt tta tta tta gca aac tat att get get att get gta 960 
Leu Gly Ser Gly Leu Leu Leu Ala Asn Tyr He Ala Ala He Ala Val 
305 310 315 320 

get ttt acc atg cct cag gat ttc agg tgc act gta atg gtt cct gtg 1008 
Ala Phe Thr Met Pro Gin Asp Phe Arg Cys Thr Val Met Val Pro Val 
325 330 335 



1056 



1104 



1149 



agagatcttg tagttcatct tgatcttggg ctacagccta attcatggga gcaaatgaaa 1209 

agagggagaa gttggcaaag tgaggtctgt tgtgcatatt ttcaacggaa acaatggagt 1269 

agcaatattg etatgetagg gttctgaagt tgtaggagct tttcgaagct tttacgatgt 1329 

tgaaggcgtt gttgttggag ctgtggaagc tgtttttctt tttttccttt tgtatcaaca 1389 

gtgtcgcgtt ctgtacggtc ttacttggaa gtgetttgae ctttgaacac atgggttgaa 1449 

gcttgagatc tggtcccgaa cagatggegg tggaaeggee aagacaagct tgtttcatgc 1509 

cactcgaggt cgaggctaaa ccactacggc gtgctcttcc atgaaacgea gaaaactagg 1569 
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gaaatgacta tatatatggt gcaatacgtt gtatattttc tgagtttcag ctcgtatata 1629 

tagtaggaac ctcaactttt accccatcga ttggaagact gaaacttctt gcatgcgtat 1689 

gtatgcctgt gggtatgtaa aaaccttggc ccgcacaaag ctacatgtta cagaactttc 174 9 

agctcaaaaa aaaaaaaaaa ag 1771 

<210> 14 

<211> 382 

<212> PRT 

<213> Zea mays 



<400> 14 



Pro 


Arg 


Val 


Arg 


Ala 


Ser 


Leu 


Pro 


Leu 


Pro 


Pro 


Ser 


Thr 


Ala 


Val 


Thr 


1 








5 










10 










15 




Ala 


Arg 


Phe 


Leu 


Ala 


Ala 


Pro 


Ala 


lie 


Arg 


Val 


He 


Ser 


Pro 


Ser 


Arg 








20 










25 










30 






Pro 


Ala 


Leu 


Pro 


Leu 


Leu 


Ser 


Ser 


Ala 


Ser 


Ala 


Gly 


Gly 


Phe 


Pro 


His 






35 










40 










45 








Ala 


Ser 


Arg 


Ala 


Pro Cys 


Ser 


Ala 


Ala 


Arg 


Glu 


His 


Arg 


Arg 


Gly 


Thr 




50 










55 










60 










Val 


Arg 


Glu 


Cys 


Ser 


Arg 


Ala 


Asp 


Ala 


Ala 


Gly 


Ala 


Ala 


P.ro 


Leu 


Ser 


65 










70 










75 










80 


Lys 


Thr 


Leu 


Leu 


Asp 


Leu 


Lys 


Asp 


Ser 


Cys 


Trp 


Arg 


Phe 


Leu 


Arg 


Pro 










85 










90 










95 




His 


Thr 


He 


Arg 


Gly Thr 


Ala 


Leu 


Gly 


Ser 


lie 


Ala 


Leu 


Val 


Ala 


Arg 








100 










105 










110 






Ala 


Leu 


He 


Glu 


Asn 


Ser 


His 


Leu 


He 


Asn 


Trp 


Trp 


Leu 


lie 


Phe 


Lys 






115 










120 










125 






Ala 


Phe 


Tyr 


Gly 


Leu Gly 


Ala 


Leu 


He 


Phe 


Gly 


Asn 


Gly 


Tyr 


lie 


Val 




130 










135 










140 










Gly 


He 


Asn 


Gin 


lie 


Tyr 


Asp 


Val 


Ala 


lie 


Asp 


Lys 


Val 


Asn 


Lys 


Pro 


145 










150 










155 








160 


Tyr 


Leu 


Pro 


He 


Ala 


Ala 


Gly 


Asp 


Leu 


Ser 


He 


Gin 


Ser 


Ala 


Trp 


Leu 










165 










170 










175 




Leu 


Val 


He 


Leu 


Phe 


Ala 


Ala 


Ala 


Gly 


Phe 


Ser 


lie 


Val 


lie 


Ser 


Asn 








180 










185 










190 






Phe 


Gly 


Pro 


Phe 


He 


Thr 


Ser 


Leu 


Tyr 


Cys 


Leu 


Gly 


Leu 


Phe 


Leu 


Gly 






195 










200 










205 








Thr 


He 


Tyr 


Ser 


Val 


Pro 


Pro 


Phe 


Arg 


Leu 


Lys 


Arg 


Tyr 


Pro 


Val 


Ala 




210 










215 










220 










Ala 


Phe 


Leu 


He 


He 


Ala 


Thr 


Val 


Arg 


Gly 


Phe 


Leu 


Leu 


Asn 


Phe 


Gly 


225 










230 










235 










240 


Val 


Tyr 


Tyr 


Ala 


Thr 


Arg 


Ala 


Ala 


Leu 


Gly 


Leu 


Thr 


Phe 


Gin 


Trp 


Ser 










245 










250 










255 




Ser 


Pro 


Val 


Ala 


Phe 


He 


Thr 


Cys 


Phe 


Val 


Thr 


Leu 


Phe 


Ala 


Leu 


Val 








260 










265 










270 






He 


Ala 


He 


Thr 


Lys 


Asp 


Leu 


Pro 


Asp 


Val 


Glu 


Gly 


Asp 


Arg 


Lys 


Tyr 






275 










260 










285 








Gin 


He 


Ser 


Thr 


Leu 


Ala 


Thr 


Lys 


Leu 


Gly 


Val 


Arg 


Asn 


lie 


Ala 


Phe 




290 










295 










300 










Leu 


Gly 


Ser 


Gly 


Leu 


Leu 


Leu 


Ala 


Asn 


Tyr 


lie 


Ala 


Ala 


lie 


Ala 


Val 


305 










310 










315 










320 


Ala 


Phe 


Thr 


Met 


Pro 


Gin 


Asp 


Phe 


Arg 


Cys 


Thr 


Val 


Met 


Val 


Pro 


Val 










325 










330 










335 




His 


Ala 


Val 


Leu 


Ala Gly 


Gly 


Leu 


He 


Phe 


Gin 


Thr 


Trp 


Val 


Leu 


Glu 








340 










345 










350 






Gin 


Ala 


Lys 


Tyr 


Arg 


Lys 


Asp 


Ala 


He 


Ser 


Gin 


Tyr 


Tyr 


Arg 


Phe 


lie 






355 










360 










365 








Trp 


Asn 


Leu 


Phe 


Tyr Ala 


Glu 


Tyr 


He 


Phe 


Phe 


Pro 


Leu 


lie 








370 










375 










380 











<210> 15 
<211> 1618 
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<212> DNA 

<213> Oryza sativa 

<220> 
<221> CDS 

<222> (59) . . . (1273) 
<400> 15 

gcacgagctt acaagccgcc gcgcgcgccc ggccgccgcg gtggtggcgg cggcggcg 58 
atg gat teg ctg cgc etc egg ccg teg etc etc gee gcg egg gee ccc 106 
Met Asp Ser Leu Arg Leu Arg Pro Ser Leu Leu Ala Ala Arg Ala Pro 
1 5 10 is 

ggc gcg gee teg ctg ccg cct etc egg cga gat cac ttt eta cca cct 154 
Gly Ala Ala Ser Leu Pro Pro Leu Arg Arg Asp His Phe Leu Pro Pro 
20 25 30 

tta tgt tct ate cat aga aat ggt aaa egg cca gtt tct ttg tec age 202 
Leu Cys Ser He His Arg Asn Gly Lys Arg Pro Val Ser Leu Ser Ser 
35 40 45 

caa agg acc caa ggt cct tec ttc gat caa tgt cag aaa ttc ttt ggt 250 
Gin Arg Thr Gin Gly Pro Ser Phe Asp Gin Cys Gin Lys Phe Phe Glv 
50 55 60 

tgg aaa tec tec cac cac agg ata cca cat cga cca aca tct agt tec 298 
Trp Lys Ser Ser His His Arg He Pro His Arg Pro Thr Ser Ser Ser 
65 70 75 80 

get gac get teg gga caa cct eta caa tct tea get gaa gca cat gat 346 
Ala Asp Ala Ser Gly Gin Pro Leu Gin Ser Ser Ala Glu Ala His Asp 
85 90 1 95 

tea tea agt ata tgg aag cca ata tea tct tct ccg gat gca ttt tac 394 
Ser Ser Ser He Trp Lys Pro He Ser Ser Ser Pro Asp Ala Phe Tyr 
100 10s no 

agg ttt tct egg cca cat act gtc ata gga aca gca ctt age ata gtc 442 
Arg Phe Ser Arg Pro His Thr Val He Gly Thr Ala Leu Ser He Val 
US 120 125 

tea gtt teg ctg eta get gtt gag aat ttg tec gat gtg tct ccc ttg 4 90 

Ser Val Ser Leu Leu Ala Val Glu Asn Leu Ser Asp Val Ser Pro Leu 
130 135 140 

ttc etc act ggt ttg ctg gag gca gtg gta gca get ctt ttc atg aac 53 8 

Phe Leu Thr Gly Leu Leu Glu Ala Val Val Ala Ala Leu Phe Met Asn 



145 ISO 155 , 160 

ate tat ate gtt gga ttg aat cag ttg ttc gac att gag ata gat aag 
He Tyr He Val Gly Leu Asn Gin Leu Phe Asp He Glu He Asp Lys 
165 170 175 

gtt aac aag cca act ctt cca tta gca tct ggg gaa tat tct cct gca 
Val Asn Lys Pro Thr Leu Pro Leu Ala Ser Gly Glu Tyr Ser Pro Ala 
180 185 190 

act gga gtt gca ctt gta tea gee ttc get get atg age ttt ggc ctt 
Thr Gly Val Ala Leu Val Ser Ala Phe Ala Ala Met Ser Phe Gly Leu 
1*5 200 205 

gga tgg get gtt gga tea cag cct ctg ttc ctg get ctt ttc att age 



586 



634 



682 



730 



WO 00/68393 



16 



PCT/US00/11439 



Gly Trp Ala Val Gly Ser Gin Pro Leu Phe Leu Ala Leu Phe lie Ser 
210 215 220 

ttt att ctt gga aca gca tat teg att aat ctg cca ttc ctg aga tgg 778 
Phe lie Leu Gly Thr Ala Tyr Ser lie Asn Leu Pro Phe Leu Arg Trp 
225 230 235 240 

aag aga tct get gtt gtt gca gca ctt tgc ata tta gca gtc cgt gca 826 
Lys Arg Ser Ala Val Val Ala Ala Leu Cys He Leu Ala Val Arg Ala 
245 250 255 

gtg att gtt cag ctg gca ttt ttt etc cac att cag aca ttc gta ttc 874 
Val He Val Gin Leu Ala Phe Phe Leu His He Gin Thr Phe Val Phe 
260 265 270 

aga aga cca gca gtc ttt ace agg cca ttg att ttt gca act gca ttc 922 
Arg Arg Pro Ala Val Phe Thr Arg Pro Leu He Phe Ala Thr Ala Phe 
275 280 285 

atg acc ttt ttc tec gtt gta ata gca ttg ttc aag gat ata cct gat 970 
Met Thr Phe Phe Ser Val Val He Ala Leu Phe Lys Asp He Pro Asp 
290 295 300 

att gaa gga gac cgt att ttt ggt ate aaa tct ttc agt gtt cga tta 1018 
He Glu Gly Asp Arg He Phe Gly He Lys Ser Phe Ser Val Arg Leu 
305 310 315 320 

ggt caa aag aag gtt ttc tgg att tgt gtt ggt ctg etc gag atg get 1066 
Gly Gin Lys Lys Val Phe Trp He Cys Val Gly Leu Leu Glu Met Ala 
325 330 335 

tat tgt gtt gca ata ttg atg gga get act tct gee tgt ttg tgg age 1114 
Tyr Cys Val Ala He Leu Met Gly Ala Thr Ser Ala Cys Leu Trp Ser 
340 345 350 

aaa tac gca act gtg gtg gga cat gca ate ctt gcg gca ate eta tgg 1162 
Lys Tyr Ala Thr Val Val Gly His Ala He Leu Ala Ala He Leu Trp 
355 360 365 

aac cgc tea egg teg att gat ctg aca age aaa act gca ate act tct 1210 
Asn Arg Ser Arg Ser He Asp Leu Thr Ser Lys Thr Ala He Thr Ser 
370 375 380 

ttc tac atg ttt ate tgg aag ctg ttc tac gcg gaa tac ctt etc att 1258 
Phe Tyr Met Phe lie Trp Lys Leu Phe Tyr Ala Glu Tyr Leu Leu He 
385 390 395 400 

cct ctt gta agg tga caaaggegat tactccaggt agattggaat tggatcatgg 1313 
Pro Leu Val Arg * 



ctggatggat gaaeggaegg cgccccataa aatcacctgc aaatcacccg gtacacatgt 1373 

tgacatcctg catccagata tgatattgat agatcatcgt cggcaccatc attcctctga 1433 

aagatttege aeggcattte aacctccaac tcccaacgta ccccaaaaaa agtaactagg 1493 

ccaggtgagc atetgetage ctatagtaga cgttattgga acagtggtag tacttgttag 1553 

cagcagtaat aataatcatc ataataaagc tctgggttac tgtcaaaaaa aaaaaaaaaa 1613 



<210> 16 

<211> 404 

<212> PRT 

<213> Oryza sativa 
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<400> 16 

Met Asp Ser Leu Arg Leu Arg Pro Ser Leu Leu Ala Ala 

1 5 io 

Gly Ala Ala Ser Leu Pro Pro Leu Arg Arg Asp His Phe 

20 25 
Leu Cys Ser He His Arg Asn Gly Lys Arg Pro Val Ser 

35 40 45 

Gin Arg Thr Gin Gly Pro Ser Phe Asp Gin Cys Gin Lys 

50 55 60 

Trp Lys Ser Ser His His Arg He Pro His Arg Pro Thr 
65 70 75 

Ala Asp Ala Ser Gly Gin Pro Leu Gin Ser Ser Ala Glu 

85 90 
Ser Ser Ser He Trp Lys Pro He Ser Ser Ser Pro Asp 

100 105 
Arg Phe Ser Arg Pro His Thr Val He Gly Thr Ala Leu 
115 120 125 

Ser Val Ser Leu Leu Ala Val Glu Asn Leu Ser Asp Val 

130 135 140 

Phe Leu Thr Gly Leu Leu Glu Ala Val Val Ala Ala Leu 
145 iso 155 

He Tyr He Val Gly Leu Asn Gin Leu Phe Asp He Glu 

165 170 
Val Asn Lys Pro Thr Leu Pro Leu Ala Ser Gly Glu Tyr 

180 185 
Thr Gly Val Ala Leu Val Ser Ala Phe Ala Ala Met Ser 
1^5 200 205 

Gly Trp Ala Val Gly Ser Gin Pro Leu Phe Leu Ala Leu 

210 215 220 

Phe He Leu Gly Thr Ala Tyr Ser He Asn Leu Pro Phe 
225 230 235 

Lys Arg Ser Ala Val Val Ala Ala Leu Cys He Leu Ala 

245 250 
Val He Val Gin Leu Ala Phe Phe Leu His He Gin Thr 

260 265 
Arg Arg Pro Ala Val Phe Thr Arg Pro Leu He Phe Ala 
275 280 285 

Met Thr Phe Phe Ser Val Val He Ala Leu Phe Lys Asp 

290 295 300 

He Glu Gly Asp Arg He Phe Gly He Lys Ser Phe Ser 
305 310 315 

Gly Gin Lys Lys Val Phe Trp He Cys Val Gly Leu Leu 

325 330 
Tyr Cys Val Ala He Leu Met Gly Ala Thr Ser Ala Cys 

340 345 
Lys Tyr Ala Thr Val Val Gly His Ala He Leu Ala Ala 
355 360 365 

Asn Arg Ser Arg Ser He Asp Leu Thr Ser Lys Thr Ala 

370 375 380 

Phe Tyr Met Phe lie Trp Lys Leu Phe Tyr Ala Glu Tyr 
385 390 395 

Pro Leu Val Arg 



Arg Ala Pro 
15 

Leu Pro Pro 
30 

Leu Ser Ser 

Phe Phe Gly 

Ser Ser Ser 
80 

Ala His Asp 
95 

Ala Phe Tyr 
110 

Ser lie Val 

Ser Pro Leu 

Phe Met Asn 
160 

lie Asp Lys 

175 
Ser Pro Ala 
190 

Phe Gly Leu 

Phe lie Ser 

Leu Arg Trp 
240 

Val Arg Ala 

255 
Phe Val Phe 
270 

Thr Ala Phe 

lie Pro Asp 

Val Arg Leu 
320 

Glu Met Ala 

335 
Leu Trp Ser 
350 

lie Leu Trp 

lie Thr Ser 

Leu Leu lie 
400 



<210> 17 

<211> 1733 

<212> DNA 

<213> Oryza Sativa 

<220> 
<221> CDS 
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<222> (1) . . . (1137) 
<400> 17 

ctt aca etc gec tec cct cct etc ccc tgc cgc gee gec gec ace gec 4 8 

Leu Thr Leu Ala Ser Pro Pro Leu Pro Cys Arg Ala Ala Ala Thr Ala 
15 10 15 

age cgc age ggg cgt cct get ccg cgc etc etc ggc cct ccg ccg ccg 96 
Ser Arg Ser Gly Arg Pro Ala Pro Arg Leu Leu Gly Pro Pro Pro Pro 
20 25 30 

ccc get tec cct etc etc tec tec get teg gcg cgc ttc ccg cgt gec 144 
Pro Ala Ser Pro Leu Leu Ser Ser Ala Ser Ala Arg Phe Pro Arg Ala 
35 40 45 

ccc tgc aac gec gca cgc tgg age egg cgc gac gee gtg egg gtt tgc 192 
Pro Cys Asn Ala Ala Arg Trp Ser Arg Arg Asp Ala Val Arg Val Cys 
50 55 60 

tct caa get ggt gca get gga cca gee cca tta teg aag aca ttg tea 240 
Ser Gin Ala Gly Ala Ala Gly Pro Ala Pro Leu Ser Lys Thr Leu Ser 
65 70 75 80 

gac etc aag gat tec tgc tgg aga ttt tta egg cca cat aca att cga 2 88 

Asp Leu Lys Asp Ser Cys Trp Arg Phe Leu Arg Pro His Thr He Arg 
85 90 95 

gga act gec ttg gga tec ata gca tta gtt get aga get ttg ata gag 336 
Gly Thr Ala Leu Gly Ser He Ala Leu Val Ala Arg Ala Leu He Glu 
100 105 110 

aac ccc caa ctg ata aat tgg tgg ttg gta ttc aaa gcg ttc tat ggg 384 
Asn Pro Gin Leu He Asn Trp Trp Leu Val Phe Lys Ala Phe Tyr Gly 
115 120 125 

etc gtg gcg tta ate tgt ggc aat ggt tac ate gtt ggg ate aat cag 432 
Leu Val Ala Leu He Cys Gly Asn Gly Tyr He Val Gly He Asn Gin 
130 135 140 

ate tat gac att aga ate gat aag gta aac aag cca tat tta cca att 480 
He Tyr Asp He Arg He Asp Lys Val Asn Lys Pro Tyr Leu Pro He 
145 150 155 160 

get gee ggt gat etc tea gtt cag aca gca tgg tta ttg gtg gta tta 528 
Ala Ala Gly Asp Leu Ser Val Gin Thr Ala Trp Leu Leu Val Val Leu 
165 170 175 

ttt gca get gcg gga ttt tea att gtt gtg aca aac ttt gga cct ttc 576 
Phe Ala Ala Ala Gly Phe Ser He Val Val Thr Asn Phe Gly Pro Phe 
180 185 190 

att ace tct eta tat tgc ctt ggt eta ttt ctt ggc ace ata tac tct 624 
He Thr Ser Leu Tyr Cys Leu Gly Leu Phe Leu Gly Thr He Tyr Ser 
195 200 205 

gtt cct cca ttc aga ctt aag aga tat cct gtt get get ttt ctt ate 672 
Val Pro Pro Phe Arg Leu Lys Arg Tyr Pro Val Ala Ala Phe Leu He 
210 215 220 

att gca acg gtc cgt ggt ttt ctt etc aac ttt ggt gtg tac tat get 720 
He Ala Thr Val Arg Gly Phe Leu Leu Asn Phe Gly Val Tyr Tyr Ala 
225 230 235 240 
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act aga gca gca ctg ggt ctt aca ttc caa tgg age teg cct gtt get 
Thr Arg Ala Ala Leu Gly Leu Thr Phe Gin Trp Ser Ser Pro Val Ala 
245 250 255 

ttc att aca tgc ttc gtg act tta ttt get ttg gtc att get ata ace 
Phe He Thr Cys Phe Val Thr Leu Phe Ala Leu Val He Ala He Thr 
260 265 270 

aaa gat etc cca gat gtt gaa ggg gat egg aag tat caa ata tea act 
Lys Asp Leu Pro Asp Val Glu Gly Asp Arg Lys Tyr Gin He Ser Thr 
275 280 285 

ttg gcg aca aag etc ggt gtc aga aac att gca ttt ctt ggc tct ggt 
Leu Ala Thr Lys Leu Gly Val Arg Asn He Ala Phe Leu Gly Ser Gly 
290 295 300 

tta ttg ata gca aat tat gtt get get att get gta get ttt etc atg 
Leu Leu He Ala Asn Tyr Val Ala Ala He Ala Val Ala Phe Leu Met 
3°5 310 315 320 

cct cag get ttc agg cgc act gta atg gtg cct gtg cat get gee ctt 
Pro Gin Ala Phe Arg Arg Thr Val Met Val Pro Val His Ala Ala Leu 
325 330 335 

gee gtt ggt ata att ttc cag aca tgg gtt -ctg gag caa gca aaa tat 
Ala Val Gly He He Phe Gin Thr Trp Val Leu Glu Gin Ala Lys Tyr 
340 345 350 

act aag gat get att tea cag tac tac egg ttc att tgg aat etc ttc 
Thr Lys Asp Ala He Ser Gin Tyr Tyr Arg Phe He Trp Asn Leu Phe 
355 360 365 



768 



816 



864 



912 



960 



1008 



1056 



1104 



tat get gaa tac ate ttc ttc ccg ttg ata tag agaccaagca atctgatatg 1157 
Tyr Ala Glu Tyr He Phe Phe Pro Leu He * 
370 375 

gtctgcatgt tgagtgcggc aaaaactaga ageccatatg aacagtggga gtaagggaac 1217 

gaacatgeca tccatgggaa gactctgata actctctctc gcccgggctg taaagggtaa 1277 

gcactgttgt gcatatatat gaaaggaagg tgataaagca gggatgctaa attgetactg 1337 

ggatccttaa aggcttatag tggtcaccag tggaatgtgc cttaataatt tggttaccta 1397 

gcagagcaag tttttgeagg ttattaggta atatctttga gggaatgaac ttagatttca 1457 

ttgttttaag gtctggtcac acaaegggta gtagttctgg ageggcaaaa gacgaccttg 1517 

ttttacacta ccaagggagg ttaactctag ttttcatgtg accacttacc ttgagagttg 1577 

agaccatgga atcacttgtc gactcctcgg cttgtatatt tctagtgtca geatttgeat 1637 

tctcctccac acttgtactt gaagagttga agacaacttt tttgtttgtg tatttctgga 1697 

gtgtcagcat ttgcattcaa aaaaaaaaaa aaaaaa 1733 

<210> 18 
<211> 378 
<212> PRT 

<213> Oryza Sativa 
<400> 18 

Leu Thr Leu Ala Ser Pro Pro Leu Pro Cys Arg Ala Ala Ala Thr Ala 

1 5 10 15 

Ser Arg Ser Gly Arg Pro Ala Pro Arg Leu Leu Gly Pro Pro Pro Pro 

20 25 30 

Pro Ala Ser Pro Leu Leu Ser Ser Ala Ser Ala Arg Phe Pro Arg Ala 

35 40 45 

Pro Cys Asn Ala Ala Arg Trp Ser Arg Arg Asp Ala Val Arg Val Cys 
50 55 60 
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Ser 


Gin 


Ala 


Gly 


Ala 


Ala 


Gly 


Pro 


Ala 


Pro 


Leu 


Ser 


Lys 


Thr 


Leu 


Ser 


65 










70 










75 










80 


Asp 


Leu 


Lys 


Asp 


Ser Cys 


Trp 


Arg 


Phe 


Leu 


Ara 


Pro 


His 


Thr 


He 


Arg 










85 










90 










95 




Gly 


Thr 


Ala 


Leu 


Gly Ser 


He 


Ala 


Leu 


Val 


Ala 


Ara 


Ala 


Leu 


lie 


Glu 








100 










105 










110 






Asn 


Pro 


Gin 


Leu 


He 


Asn 


Trp 


Trp 


Leu 


Val 


Phe 


Lys 


Ala 


Phe 


Tyr Gly 






115 










120 










125 








Leu 


Val 


Ala 


Leu 


He 


Cys 


Glv 


Asn 


Glv 


Tvr 


He 


Val 


Gly 


He 


Asn 


Gin 




130 










135 










140 










lie 


Tyr 


Asp 


He 


Arg 


He 


ASD 


Lvs 


Val 


Asn 


Lys 


Pro 


Tvr 


Leu 


Pro 


He 


145 










150 










155 










160 


Ala 


Ala 


Glv 
i 


Asp 


Leu 


Ser 


Val 


Gin 


Thr 


Ala 


Trn 


Leu 


Leu 


Va 1 
veil 


Val 


Leu 










165 










170 










175 




Phe 


Ala 


Ala 


Ala 


Gly 


Phe 


Ser 


He 


Val 


Val 


Thr 


Asn 


trlXtz 


Gly 


Pro 


Phe 








180 










185 










190 






He 


Thr 


Ser 


Leu 


Tyr 


Cys 


Leu 


Glv 
ui v 


Leu 


Phe 


Leu 




1X11 


He 


Tyr 


Ser 






195 










200 
















Val 


Pro 


Pro 


Phe 


Arg 


Leu 


Lys 


Ara 


Tvr 
j 


Pro 


Val 


Ala 


Ala 


Phe 


Leu 


He 




210 










215 










220 










He 


Ala 


Thr 


Val 


Arg Gly 


Phe 


Leu 


Leu 


Asn 


Phe 


vj-L y 


Val 
vai 


Tyr Tyr Ala 


225 










230 










235 










240 


Thr 


Arq 


Ala 


Ala 


Leu 


Gly 


Leu 


Thr 


Phe 


Gin 






Ser 


Pro 


Val 


Ala 










245 










250 










255 




Phe 


He 


Thr 


Cvs 


Phe 


Val 


Thr 


Leu 


Phe 


Ala 


Leu 


vol 


Tl 
lie 


Ala 


He 


Thr 








260 










265 










270 






Lys 


Asp 


Leu 


Pro 


Asp 


Val 


Glu 


Glv 


Asp 


Ira 


ys 


Tyr 


vjih 


He 


Ser 


Thr 






275 










280 










285 








Leu 


Ala 


Thr 


Lys 


Leu 


Gly 


Val 


Bra 


Asn 


He 


Ala 




Leu 


Gly Ser Gly 




290 










295 










300 










Leu 


Leu 


He 


Ala 


Asn 


Tyr 


Val 


Ala 


Ala 


Tl 
lie 


/\1 d 


Val 


Aj.a 


Phe 


Leu 


Met 


305 










310 










315 










320 


Pro 


Gin 


Ala 


Phe 


Arg 


Arg 


Thr 


Val 


Met 


Val 


Pro 


Val 


His 


Ala 


Ala 


Leu 










325 










330 










335 




Ala 


Val 


Gly 


He 


He 


Phe 


Gin 


Thr 


Trp 


Val 


Leu 


Glu 


Gin 


Ala 


Lys 


Tyr 








340 










345 










350 


Thr 


Lys 


Asp 


Ala 


He 


Ser 


Gin 


Tyr 


Tyr 


Arg 


Phe 


He 


Trp 


Asn 


Leu 


Phe 






355 










360 










365 








Tyr 


Ala 


Glu 


Tyr 


He 


Phe 


Phe 


Pro 


Leu 


He 















370 375 



<210> 19 
<211> 1400 
<212> DNA 
<213> Glycine max 

<220> 
<221> CDS 

<222> (37) . . . (1203) 
<400> 19 

ctgcagggtt ttttcgtttg ctgtgttcag ctcctt atg gag etc tea etc tct 54 

Met Glu Leu Ser Leu Ser 
1 5 

cca act tea cat cgt gtt cct tec aca att ccc act ttg aat ttc get 102 
Pro Thr Ser His Arg Val Pro Ser Thr He Pro Thr Leu Asn Phe Ala 
10 15 20 

aaa eta tea ttc act aag gee aca acg tec caa cct ttg ttc tta gga 150 
Lys Leu Ser Phe Thr Lys Ala Thr Thr Ser Gin Pro Leu Phe Leu Gly 
25 30 35 
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ttt tec aaa cac ttc aac tea att ggg ttg aac cat cac agt tac aga 
Phe Ser Lys His Phe Asn Ser He Gly Leu Asn His His Ser Tyr Arg 
40 4S 50 

tgc tgc tea aat get gtt cct aag aga ccc caa aga ccc agt tec ata 
Cys Cys Ser Asn Ala Val Pro Lys Arg Pro Gin Arg Pro Ser Ser He 
55 60 65 70 



gaa aga ctt tta gat ttg aaa gat get tgc tgg aga ttt tta agg cca 
Glu Arg Leu Leu Asp Leu Lys Asp Ala Cys Trp Arg Phe Leu Arg Pro 
90 95 100 



gca ttg att gag aac acg aat ttg ata aag tgg tct ctt ttg ttc aaa 
Ala Leu He Glu Asn Thr Asn Leu He Lys Trp Ser Leu Leu Phe Lys 
120 125 130 

get ttc tct ggt ctt ttt gee ctg att tgt ggg aat ggt tat ata gtt 
Ala Phe Ser Gly Leu Phe Ala Leu He Cys Gly Asn Gly Tyr lie Val 
135 140 145 150 

ggc ate aat caa ate tat gac att age att gac aag gta aac aaa cct 
Gly He Asn Gin He Tyr Asp He Ser He Asp Lys Val Asn Lys Pro 
155 160 165 

tat tta cct ata get get gga gat ctt tct gtc caa tct gca tgg ttc 
Tyr Leu Pro He Ala Ala Gly Asp Leu Ser Val Gin Ser Ala Trp Phe 
170 175 180 

ttg gtt ata ttt ttt gca gca get ggc ctg teg att gca ggg ttg aac 
Leu Val He Phe Phe Ala Ala Ala Gly Leu Ser He Ala Gly Leu Asn 
185 190 195 

ttt ggg cct ttc att ttt tct ctt tac aca ctt ggc ctt ttc ttg gga 
Phe Gly Pro Phe He Phe Ser Leu Tyr Thr Leu Gly Leu Phe Leu Gly 
200 205 210 



gtg tac tat gee act aga get tec ctt ggg ctt gca ttt gaa tgg age 
Val Tyr Tyr Ala Thr Arg Ala Ser Leu Gly Leu Ala Phe Glu Trp Ser 
250 255 260 

tct cct gtg gtt ttt ate aca aca ttt gta aca ttt ttc gca ctg gta 
Ser Pro Val Val Phe He Thr Thr Phe Val Thr Phe Phe Ala Leu Val 
265 270 275 

att get ata aca aaa gat ctt cct gat gtt gaa ggt gat cgc aag tat 



198 



246 



agg gee tgc act gga gtt gga get get ggt tct gat cgt cca tta get , 294 
Arg Ala Cys Thr Gly Val Gly Ala Ala Gly Ser Asp Arg Pro Leu Ala 
75 80 85 



342 



cat act ata cgt ggt aca gca eta ggt tea ttt get ttg gtg gca aga 390 
His Thr He Arg Gly Thr Ala Leu Gly Ser Phe Ala Leu Val Ala Arg 
105 HO 115 



438 



486 



534 



582 



630 



678 



acc ate tat tct gtt cct cca ttg agg atg aaa cgc ttt cct gtt gca 726 
Thr He Tyr Ser Val Pro Pro Leu Arg Met Lys Arg Phe Pro Val Ala 
215 220 225 230 

gca ttt ctt ata att gee acg gta cgt ggt ttt etc ctt aac ttt ggt 774 
Ala Phe Leu He He Ala Thr Val Arg Gly Phe Leu Leu Asn Phe Gly 
235 240 245 



822 



870 



918 
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lie Ala lie Thr Lys Asp Leu Pro Asp Val Glu Gly Asp Arg Lys Tyr 
280 , 285 290 

cag ata tea acc ttt get aca aaa tta gga gtt egg aac att get ttc 966 
Gin lie Ser Thr Phe Ala Thr Lys Leu Gly Val Arg Asn lie Ala Phe 
295 300 305 310 

ctt ggt tct gga att ttg ctg gtg aat tat att gtt tct gtt ttg gca 1014 
Leu Gly Ser Gly lie Leu Leu Val Asn Tyr He Val Ser Val Leu Ala 
315 320 325 

gca att tat atg cct cag get ttc agg cgt tgg tta etc ata cca get 1062 
Ala He Tyr Met Pro Gin Ala Phe Arg Arg Trp Leu Leu He Pro Ala 
330 335 340 

cat aca att ttt gca ata age ttg att tac cag gca cga ata tta gaa 1110 
His Thr He Phe Ala He Ser Leu He Tyr Gin Ala Arg He Leu Glu 
345 350 355 

caa gca aat tat acc aag gat gca ata tea gga ttc tat cga ttc ata 1158 
Gin Ala Asn Tyr Thr Lys Asp Ala He Ser Gly Phe Tyr Arg Phe He 
360 365 370 

tgg aat ctg ttc tat get gag tat gca ata ttt cct ttc ata tag 1203 
Trp Asn Leu Phe Tyr Ala Glu Tyr Ala He Phe Pro Phe He * 
375 380 385 

caaaccttgc tacttttttc ttgggaaaag gtgcatacgt gcatagttag agagatcttt 1263 

gtttatcaag tgtcaattgg taaactagct atcattattt ttttaaaatg agtattgttg 1323 

tatataaatg tgatactatt tccttttact ttgacgtaat gecattaaca tatttcataa 13B3 

aaaaaaaaaa aaaaaaa 1400 

<210> 20 

<211> 388 

<212> PRT 

<213> Glycine max 

<400> 20 



Met 


Glu 


Leu 


Ser 


Leu 


Ser 


Pro 


Thr 


Ser 


His 


Arg 


Val 


Pro 


Ser 


Thr 


He 


1 








5 










10 










15 




Pro 


Thr 


Leu 


Asn 


Phe 


Ala 


Lys 


Leu 


Ser 


Phe 


Thr 


Lys 


Ala 


Thr 


Thr 


Ser 








20 










25 










30 






Gin 


Pro 


Leu 


Phe 


Leu 


Gly 


Phe 


Ser 


Lys 


His 


Phe 


Asn 


Ser 


He 


Gly 


Leu 






35 










40 










45 






Asn 


His 


His 


Ser 


Tyr 


Arg 


Cys 


Cys 


Ser 


Asn 


Ala 


Val 


Pro 


Lys 


Arg 


Pro 




50 










55 










60 








Gin 


Arg 


Pro 


Ser 


Ser 


He 


Arg 


Ala 


Cys 


Thr 


Gly Val 


Gly 


Ala 


Ala 


Gly 


65 










70 










75 










80 


Ser 


Asp 


Arg 


Pro 


Leu 


Ala 


Glu 


Arg 


Leu 


Leu 


Asp 


Leu 


Lys 


Asp 


Ala 


Cys 










85 










90 










95 




Trp 


Arg 


Phe 


Leu 


Arg 


Pro 


His 


Thr 


He 


Arg 


Gly Thr 


Ala 


Leu 


Gly 


Ser 








100 










105 










110 




Phe 


Ala 


Leu 


Val 


Ala 


Arg 


Ala 


Leu 


He 


Glu 


Asn 


Thr 


Asn 


Leu 


He 


Lys 






115 










120 










125 






Trp 


Ser 


Leu 


Leu 


Phe 


Lys 


Ala 


Phe 


Ser 


Gly 


Leu 


Phe 


Ala 


Leu 


He 


Cys 




130 










135 










140 








Gly 


Asn 


Gly 


Tyr 


He 


Val 


Gly 


He 


Asn 


Gin 


He 


Tyr 


Asp 


He 


Ser 


He 


145 










150 










155 










160 


Asp 


Lys 


Val 


Asn 


Lys 


Pro 


Tyr 


Leu 


Pro 


He 


Ala 


Ala 


Gly 


Asp 


Leu 


Ser 










165 










170 










175 




Val 


Gin 


Ser 


Ala 


Trp 


Phe 


lieu 


Val 


He 


Phe 


Phe 


Ala 


Ala 


Ala 


Gly 


Leu 








180 










185 










190 
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Car 


T 1 A 


Aia 


Gly 


Leu 


Asn Phe 














Leu 


Gly 


Leu 


Phe 


Leu Gly Thr 




2 10 








215 


Lys 


Arg 


Pne 


Pro 


Val 


Ala Ala 












230 


rile 


Leu 


Leu 


Asn 


Phe Gly Val 










245 




T i 

lieu 


Ala 


Phe 


Glu 


Trp 


Ser Ser 








260 






Thr 


Phe 


Phe 


Ala 


Leu 


Val He 






275 








Glu 


Gly 


Asp Arg 


Lys 


Tyr Gin 




290 








295 


Val 


Arg 


Asn 


He 


Ala 


Phe Leu 












310 


lie 


Val 


Ser 


Val 


Leu 


Ala Ala 










325 




Trp 


Leu 


Leu 


He 


Pro 


Ala His 








340 






Gin 


Ala 


Arg 


He 


Leu 


Glu Gin 






355 








Gly 


Phe 


Tyr Arg 


Phe 


He Trp 




370 








375 


Phe 


Pro 


Phe 


He 






385 













200 205 
lie Tyr Ser Val Pro Pro Leu Arg Met 
220 

Phe Leu He He Ala Thr Val Arg Gly 
235 240 
Tyr Tyr Ala Thr Arg Ala Ser Leu Gly 

250 255 
Pro Val Val Phe He Thr Thr Phe Val 

265 270 
Ala He Thr Lys Asp Leu Pro Asp Val 
280 285 
He Ser Thr Phe Ala Thr Lys Leu Gly 
300 

Gly Ser Gly He Leu Leu Val Asn Tyr 
315 320 
He Tyr Met Pro Gin Ala Phe Arg Arg 

330 335 
Thr lie Phe Ala lie Ser Leu lie Tyr 

345 350 
Ala Asn Tyr Thr Lys Asp Ala lie Ser 
360 365 
Asn Leu Phe Tyr Ala Glu Tyr Ala He 
380 



<210> 21 

<211> 1370 

<212> DNA 

<213> Glycine max 

<220> 
<221> CDS 

<222> (24) ... (1211) 
<400> 21 

gcacgagagc actactgtta tat atg gat teg atg ctt ctt cga tct ttt cct 53 

Met Asp Ser Met Leu Leu Arg Ser Phe Pro 
1 5 io 

aat att aac aac get tct tct etc gee acc act ggt tct tat ttg cca 101 
Asn lie Asn Asn Ala Ser Ser Leu Ala Thr Thr Gly Ser Tyr Leu Pro 
15 20 25 

aat get tea tgg cac aat agg aaa ate caa aaa gaa tat aat ttt ttg 149 
Asn Ala Ser Trp His Asn Arg Lys lie Gin Lys Glu Tyr Asn Phe Leu 
30 35 40 

agg ttt egg tgg cca agt ttg aac cac cat tac aaa age att gaa gga 197 
Arg Phe Arg Trp Pro Ser Leu Asn His His Tyr Lys Ser lie Glu Gly 
45 50 55 

999 tgt aca tgt aaa aaa tgt aat ata aaa ttt gtt gtg aaa gcg acc 245 
Gly Cys Thr Cys Lys Lys Cys Asn lie Lys Phe Val Val Lys Ala Thr 
60 65 70 

tct gaa aaa tct ttt gag tct gaa cct caa get ttt gat cca aaa age 293 
Ser Glu Lys Ser Phe Glu Ser Glu Pro Gin Ala Phe Asp Pro Lys Ser 
75 80 es ^ 90 

att ttg gac tct gtc aag aat tec ttg gat get ttc tac agg ttt tec 341 
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lie Leu Asp Ser Val Lys Asn Ser Leu Asp* Ala Phe Tyr Arg Phe Ser 

■ 'r 95 100 105 

aga cct cac aca gtt att ggc aca gca tta age ata att tct gtg tec 389 
Arg Pro His Thr Val lie Gly Thr Ala Leu Ser lie lie Ser Val Ser 
110 115 120 

etc ctt get gtt gag aaa ata tea gat ata tct cca tta ttt ttt act 437 
Leu Leu Ala Val Glu Lys He Ser Asp He Ser Pro Leu Phe Phe Thr 
125 130 135 

99t gtg ttg gag get gtg gtt get gec ctg ttt atg aat att tat att 4 85 

Gly Val Leu Glu Ala Val Val Ala Ala Leu Phe Met Asn He Tyr He 
140 145 150 

gtt ggt ttg aat caa ttg tct gat gtt gaa ata gac aag ata aac aag 533 
Val Gly Leu Asn Gin Leu Ser Asp Val Glu He Asp Lys He Asn Lys 
155 160 165 170 

ccg tat ctt cca tta gca tct ggg gaa tat tec ttt gaa act ggt gtc 581 
Pro Tyr Leu Pro Leu Ala Ser Gly Glu Tyr Ser Phe Glu Thr Gly Val 
175 180 185 

act att gtt gca tct ttt tea att ctg agt ttt tgg ctt ggc tgg gtt 629 
Thr He Val Ala Ser Phe Ser He Leu Ser Phe Trp Leu Gly Trp Val 
190 195 200 

gta ggt tea tgg cca tta ttt tgg gec ctt ttt gta age ttt gtg eta 677 
Val Gly Ser Trp Pro Leu Phe Trp Ala Leu Phe Val Ser Phe Val Leu 
205 210 215 

gga act get tat tea ate aat gtg cct ctg ttg aga tgg aag agg ttt 725 
Gly Thr Ala Tyr Ser He Asn Val Pro Leu Leu Arg Trp Lys Arg Phe 
220 225 230 

gca gtg ctt gca gcg atg tgc att eta get gtt egg gca gta ata gtt 773 
Ala Val Leu Ala Ala Met Cys He Leu Ala Val Arg Ala Val He Val 
235 240 245 250 

caa ctt gca ttt ttc ctt cac ate cag act cat gta tac aag agg cca 821 
Gin Leu Ala Phe Phe Leu His He Gin Thr His Val Tyr Lys Arg Pro 
255 260 265 

cct gtc ttt tea aga tea ttg att ttt get act gca ttc atg age ttc 869 
Pro Val Phe Ser Arg Ser Leu He Phe Ala Thr Ala Phe Met Ser Phe 
270 275 280 

ttc tct gta gtt ata gca ctg ttt aag gat ata cct gac att gaa gga 917 
Phe Ser Val Val He Ala Leu Phe Lys Asp He Pro Asp He Glu Gly 
285 290 295 

gat aaa gta ttt ggc ate caa tct ttt tea gtg cgt tta ggt cag aag 965 
Asp Lys Val Phe Gly He Gin Ser Phe Ser Val Arg Leu Gly Gin Lys 
300 305 310 

ccg gta ttc tgg act tgt gtt ate ctt ctt gaa ata get tat gga gtc 1013 
Pro Val Phe Trp Thr Cys Val He Leu Leu Glu He Ala Tyr Gly Val 
315 320 325 330 

gee etc ctg gtg gga get gca tct cct tgt ctt tgg age aaa att gtc 1061 
Ala Leu Leu Val Gly Ala Ala Ser Pro Cys Leu Trp Ser Lys He Val 
335 340 345 
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acg ggt ctg gga cac get gtt ctg get tea att etc tgg ttt cat gee 1109 
Thr Gly Leu Gly His Ala Val Leu Ala Ser He Leu Trp Phe His Ala 
350 355 360 

aaa tct gta gat ttg aaa age aaa get teg ata aca tec ttc tat atg 1157 
Lys Ser Val Asp Leu Lys Ser Lys Ala Ser He Thr Ser Phe Tyr Met 
365 370 375 

ttt att tgg aag eta ttt tat gca gaa tac tta etc att cct ttt gtt 1205 
Phe He Trp Lys Leu Phe Tyr Ala Glu Tyr Leu Leu He Pro Phe Val 
380 385 390 

aga tga ggatgcagcg gcaatattga cttgagaatt agttttgttt aaatggtgct 1261 

Arg * 

395 

gectttgtea caggccggct tggagtcget acattagttt taagttttta attgetaatt 1321 
taaatgaaga tatatttctt ttgggatgaa aaaaaaaaaa aaaaaaaaa 1370 

<210> 22 

<211> 395 

<212> PRT 

<213> Glycine max 

<400> 22 



Met 


Asp 


Ser 


Met 


Leu 


Leu 


Arg 


Ser 


Phe 


Pro 


Asn He Asn Asn Ala Ser 


1 








5 










10 


15 


Ser 


Leu 


Ala 


Thr 
20 


Thr 


Gly 


Ser 


Tyr 


Leu 
25 


Pro 


Asn Ala Ser Trp His Asn 
30 


Arg 


Lys 


He 
35 


Gin 


Lys 


Glu 


Tyr 


Asn 
40 


Phe 


Leu 


Arg Phe Arg Trp Pro Ser 
45 


Leu 


Asn 
50 


His 


His 


Tyr 


Lys 


Ser 
55 


He 


Glu 


Gly 


Gly Cys Thr Cys Lys Lys 
60 


Cys 


Asn 


lie 


Lys 


Phe 


Val 


Val 


Lys 


Ala 


Thr 


Ser Glu Lys Ser Phe Glu 


65 










70 










75 80 


Ser 


Glu 


Pro 


Gin 


Ala 
85 


Phe 


Asp 


Pro 


Lys 


Ser 
90 


He Leu Asp Ser Val Lys 
95 


As n 


Ser 


Leu Asp 


Ala 


Phe 


Tyr 


Arg 


Phe 


Ser 


Arg Pro His Thr Val He 








100 










105 




110 


Gly 


Thr 


Ala 
115 


Leu 


Ser 


He 


He 


Ser 
120 


Val 


Ser 


Leu Leu Ala Val Glu Lys 
125 


He 


Ser 
130 


Asp 


He 


Ser 


Pro 


Leu 
135 


Phe 


Phe 


Thr 


Gly Val Leu Glu Ala Val 
140 


Val 


Ala 


Ala 


Leu 


Phe 


Met 


Asn 


He 


Tyr 


He 


Val Gly Leu Asn Gin Leu 


145 










150 










155 160 


Ser 


Asp 


Val 


Glu 


He 
165 


Asp 


Lys 


He 


Asn 


Lys 
170 


Pro Tyr Leu Pro Leu Ala 
175 


Ser 


Gly 


Glu Tyr 


Ser 


Phe 


Glu 


Thr Gly 


Val 


Thr He Val Ala Ser Phe 








180 










185 




190 


Ser 


He 


Leu 


Ser 


Phe 


Trp 


Leu 


Gly Trp 


Val 


Val Gly Ser Trp Pro Leu 






195 










200 






205 


Phe 


Trp 
210 


Ala 


Leu 


Phe 


Val 


Ser 
215 


Phe 


Val 


Leu 


Gly Thr Ala Tyr Ser He 
220 


Asn 


Val 


Pro 


Leu 


Leu 


Arg 


Trp 


Lys Arg 


Phe 


Ala Val Leu Ala Ala Met 


225 










230 










235 240 


Cys 


He 


Leu 


Ala 


Val 


Arg 


Ala 


Val 


He 


Val 


Gin Leu Ala Phe Phe Leu 


His 








245 










250 


255 


He 


Gin 


Thr 
260 


His 


Val 


Tyr 


Lys 


Arg 
265 


Pro 


Pro Val Phe Ser Arg Ser 
270 


Leu 


He 


Phe 
275 


Ala 


Thr 


Ala 


Phe 


Met 
280 


Ser 


Phe 


Phe Ser Val Val He Ala 
285 
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Leu 

Gin 
305 
Val 

Ala 

Val 

Ser 

Tyr 
385 



Phe Lys Asp 

2 90;. 

Ser Phe Ser 

He Leu Leu 

Ser Pro Cys 
340 

Leu Ala Ser 

355 
Lys Ala Ser 
370 

Ala Glu Tyr 



He Pro Asp 
295 

Val Arg Leu 

310 
Glu lie Ala 
325 

Leu Trp Ser 

He Leu Trp 

lie Thr Ser 
375 

Leu Leu He 
390 



He Glu Gly Asp 
Gly Gin 
Tyr Gly 



Lys lie 
345 
Phe His 
360 

Phe Tyr 
Pro Phe 



Lys Pro 
315 
Val Ala 
330 

Val Thr 



Ala Lys 

Met Phe 

Val Arg 
395 



Lys Val Phe Gly lie 
300 

Val Phe Trp Thr Cys 
320 

Leu Leu Val Gly Ala 
335 

Gly Leu Gly His Ala 
350 

Ser Val Asp Leu Lys 
365 

lie Trp Lys Leu Phe 
380 



<210> 23 
<211> 1575 
<212> DNA 
<213> Glycine max 

<220> 
<221> CDS 

<222> (109) . . . (1338) 
<400> 23 

cgaggagaga gagagaacta gtctcgagtt tagtctctac aatcactcct tcctctcatc 60 
ctctataaag aaagtgctta atttgtgttg ttacttggtt cagtttcc atg gat tgg 117 

Met Asp Trp 
1 



ggg ctt get ata tct tct cat cct aaa cct tat tea gtc aca act ggt 165 
Gly Leu Ala He Ser Ser His Pro Lys Pro Tyr Ser Val Thr Thr Gly 
S 10 15 



gga aat etc tgg egg agt aaa cac acc acc aag aat att tac ttt gca 213 
Gly Asn Leu Trp Arg Ser Lys His Thr Thr Lys Asn He Tyr Phe Ala 
20 25 30 35 



agt tct tgg ata tea aaa get tea cga cac aaa agg gaa act caa ata 261 
Ser Ser Trp lie Ser Lys Ala Ser Arg His Lys Arg Glu Thr Gin lie 
40 45 50 



gaa cat aat gtt ttg agg ttc caa caa cca agt ttg gat cat cat tac 309 
Glu His Asn Val Leu Arg Phe Gin Gin Pro Ser Leu Asp His His Tyr 
55 60 65 



aaa tgc ate aga gga ggg tct aca tat caa gaa tgc aat aga aaa ttt 357 
Lys Cys He Arg Gly Gly Ser Thr Tyr Gin Glu Cys Asn Arg Lys Phe 
70 75 80 

gtt gtg aag gca ate tct aaa caa cct ctt ggt ttt gaa get cat get 405 
Val Val Lys Ala lie Ser Lys Gin Pro Leu Gly Phe Glu Ala His Ala 
85 90 95 

tec aat cct aag aac att ttg gac tct gtc aaa aat gta ttg tct get 453 
Ser Asn Pro Lys Asn lie Leu Asp Ser Val Lys Asn Val Leu Ser Ala 
100 105 110 lis 



ttc tac tgg ttt tec tat cca tac aca atg att ggc ata aca tta tgc 
Phe Tyr Trp Phe Ser Tyr Pro Tyr Thr Met lie Gly He Thr Leu Cys 
120 125 130 



501 
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gca ttt tct tea tct ctt etc gcg gtg gaa aaa tta tea gat ata tct 
Ala Phe Ser Ser Ser Leu Leu Ala Val Glu Lys Leu Ser Asp lie Ser 
135 140 145 



549 



tta tea ttt tta att ggc gtg tta cag ggt gtg ctg cct caa ttg ttt 597 
Leu Ser Phe Leu lie Gly Val Leu Gin Gly Val Leu Pro Gin Leu Phe 
150 155 i6o 

att gaa att tat ctt tgt ggt gtg aat caa ctg tat gac ctt gaa ata 645 
He Glu He Tyr Leu Cys Gly Val Asn Gin: Leu Tyr Asp Leu Glu He 
165 170 175 

gac aag ata aac aaa cca cat ctt cca atg gca tct gga caa ttt tec 693 
Asp Lys He Asn Lys Pro His Leu Pro Met Ala Ser Gly Gin Phe Ser 
180 185 190 i 9 5 

ttt aaa ace ggt gtc att att tct gca gca ttt tta get ctg agt ttt 741 
Phe Lys Thr Gly Val He He Ser Ala Ala Phe Leu Ala Leu Ser Phe 
200 205 210 

gga ttt act tgg att acc ggc tct tgg cca ttg att tgt aat ctt gta 789 
Gly Phe Thr Trp He Thr Gly Ser Trp Pro Leu He Cys Asn Leu Val 
215 220 225 



837 



885 



933 



981 



1029 



gta ate get tea teg tgg acg get tat tea ate gat gtg ccc eta ctg 
Val He Ala Ser Ser Trp Thr Ala Tyr Ser lie Asp Val Pro Leu Leu 
230 235 240 

aga tgg aag aga tac cca ttt gtc gca gca atg tgc atg att tct act 
Arg Trp Lys Arg Tyr Pro Phe Val Ala Ala Met Cys Met He Ser Thr 
245 250 255 

tgg get ctt gca ttg cca att tea tat ttc cat cac atg cag acc gtt 
Trp Ala Leu Ala Leu Pro He Ser Tyr Phe His His Met Gin Thr Val 
260 265 270 2 75 

gtg ttg aag agg cca att ggc ttt cca aga tea ttg ggt ttt ctt gtt 
Val Leu Lys Arg Pro He Gly Phe Pro Arg Ser Leu Gly Phe Leu Val 
280 285 290 

gca ttc atg acc ttc tac tec ttg ggt ttg gca ttg tec aag gat ata 
Ala Phe Met Thr Phe Tyr Ser Leu Gly Leu Ala Leu Ser Lys Asp He 
295 300 305 

cct gac gtt gaa gga gat aaa gag cac ggc att gat tct ttt gca gta 1077 
Pro Asp Val Glu Gly Asp Lys Glu His Gly He Asp Ser Phe Ala Val 
310 315 320 

cgt eta ggt cag aaa egg gca ttt tgg att tgc gtt tec ttt ttt gaa 1125 
Arg Leu Gly Gin Lys Arg Ala Phe Trp lie Cys Val Ser Phe Phe Glu 
325 330 335 

atg get ttc gga gtt ggt ate ctg gee gga gca tea tgc tea cac ttt 1173 
Met Ala Phe Gly Val Gly He Leu Ala Gly Ala Ser Cys Ser His Phe 
340 345 3S0 355 

tgg act aaa att ttc acg ggt atg gga aat get gtt ctt get tea att 1221 
Trp Thr Lys lie Phe Thr Gly Met Gly Asn Ala Val Leu Ala Ser lie 
360 365 370 

etc tgg tac caa gee aag tec gta gat ttg age gac aaa get tec act 1269 
Leu Trp Tyr Gin Ala Lys Ser Val Asp Leu Ser Asp Lys Ala Ser Thr 
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375 380 385 

gga tct ttc tat atg ttc ate tgg aag eta ttg tat gca ggg ttc ttt 1317 
Gly Ser Phe Tyr Met Phe lie Trp Lys Leu Leu Tyr Ala Gly Phe Phe 
390 395 400 

etc atg gca tta att aga tga ggatatcgtg gaaggcttaa acaatgttct 13 68 
Leu Met Ala Leu lie Arg * 
405 

cgacacatac accaaaataa aaggaatata tgttttgcat ctaagattta ttaaataaag 1428 

ccgaatgttg gttcttgtat cattaagatt ttttttttaa ttgtcgaaga ctttatgtat 1488 

tcatattcac cttgacttct aeggtcaaat ttttcataaa gtggaataaa agcaacttgg 154 8 

tatacaaaaa aaaaaaaaaa aaaaaaa 1575 

<210> 24 

<211> 409 

<212> PRT 

<213> Glycine max 





<400> 


24 


























Met 


Asp 


Trp 


Gly 


Leu 


Ala 


lie 


Ser 


Ser 


His 


Pro 


Lys 


Pro 


Tyr 


Ser 


Val 


1 








5 










10 










15 




Thr 


Thr 


Gly 


Gly 


Asn 


Leu 


Trp 


Arg 


Ser 


Lys 


His 


Thr 


Thr 


Lys 


Asn 


He 








20 










25 










30 






Tyr 


Phe 


Ala 


Ser 


Ser 


Trp 


He 


Ser 


Lys 


Ala 


Ser 


Arg 


His 


Lys 


Arg 


Glu 






35 










40 










45 








Thr 


Gin 


He 


Glu 


His 


Asn 


Val 


Leu 


Arg 


Phe 


Gin 


Gin 


Pro 


Ser 


Leu 


Asp 




50 










55 










60 










His 


His 


Tyr 


Lys 


Cys 


He 


Arg 


Gly 


Gly 


Ser 


Thr 


Tyr 


Gin 


Glu 


Cys 


Asn 


65 










70 










7S 










80 


Arg 


Lys 


Phe 


Val 


Val 


Lys 


Ala 


He 


Ser 


Lys 


Gin 


Pro 


Leu 


Gly 


Phe 


Glu 










85 










90 










95 




Ala 


His 


Ala 


Ser 


Asn 


Pro 


Lys 


Asn 


He 


Leu 


Asp 


Ser 


Val 


Lys 


Asn 


Val 








100 










105 










110 






Leu 


Ser 


Ala 


Phe 


Tyr 


Trp 


Phe 


Ser 


Tyr 


Pro 


Tyr 


Thr 


Met 


He 


Gly 


He 






115 










120 










125 








Thr 


Leu 


Cys 


Ala 


Phe 


Ser 


Ser 


Ser 


Leu 


Leu 


Ala 


Val 


Glu 


Lys 


Leu 


Ser 




130 










135 










140 










Asp 


He 


Ser 


Leu 


Ser 


Phe 


Leu 


He 


Gly 


Val 


Leu 


Gin 


Gly 


Val 


Leu 


Pro 


145 










150 










155 










160 


Gin 


Leu 


Phe 


He 


Glu 


He 


Tyr 


Leu 


Cys 


Gly 


Val 


Asn 


Gin 


Leu 


Tyr 


Asp 










165 










170 










175 




Leu 


Glu 


He 


Asp 


Lys 


He 


Asn 


Lys 


Pro 


His 


Leu 


Pro 


Met 


Ala 


Ser 


Gly 








180 










185 










190 






Gin 


Phe 


Ser 


Phe 


Lys 


Thr 


Gly 


Val 


He 


He 


Ser 


Ala 


Ala 


Phe 


Leu 


Ala 






195 










200 










205 








Leu 


Ser 


Phe 


Gly 


Phe 


Thr 


Trp 


He 


Thr 


Gly 


Ser 


Trp 


Pro 


Leu 


He 


Cys 




210 










215 










220 










Asn 


Leu 


Val 


Val 


He 


Ala 


Ser 


Ser 


Trp 


Thr 


Ala 


Tyr 


Ser 


He 


Asp 


Val 


225 










230 










235 










240 


Pro 


Leu 


Leu 


Arg 


Trp 


Lys 


Arg 


Tyr 


Pro 


Phe 


Val 


Ala 


Ala 


Met 


Cys 


Met 










245 










250 










255 




He 


Ser 


Thr 


Trp 


Ala 


Leu 


Ala 


Leu 


Pro 


He 


Ser 


Tyr 


Phe 


His 


His 


Met 








260 










265 










270 






Gin 


Thr 


Val 


Val 


Leu 


Lys 


Arg 


Pro 


He 


Gly 


Phe 


Pro 


Arg 


Ser 


Leu 


Gly 






275 










280 










285 








Phe 


Leu 


val 


Ala 


Phe 


Met 


Thr 


Phe 


Tyr 


Ser 


Leu 


Gly 


Leu 


Ala 


Leu 


Ser 




290 










295 










300 










Lys 


Asp 


He 


Pro 


Asp 


Val 


Glu 


Gly 


Asp 


Lys 


Glu 


His 


Gly 


He 


Asp 


Ser 


305 










310 










315 










320 


Phe 


Ala 


Val 


Arg 


Leu 


Gly 


Gin 


Lys 


Arg 


Ala 


Phe 


Trp 


He 


Cys 


Val 


Ser 
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325 










330 




335 


Phe 


Phe 


Glu 


Met 


Ala 


Phe 


Gly Val 


Gly 


He 


Leu 


Ala Glv Ala Ser fvc 








340 










345 








Ser 


His 


Phe 


Trp 


Thr 


Lys 


He 


Phe 


Thr Gly Met 


Gly Asn Ala Val Leu 






35S 










360 








365 


Ala 


Ser 
370 


lie 


Leu 


Trp 


Tyr 


Gin 
375 


Ala 


Lys 


Ser 


Val 


Asp Leu Ser Asp Lys 
380 


Ala 


Ser Thr Gly 


Ser 


Phe 


Tyr 


Met 


Phe 


He 


Trp 


Lys Leu Leu Tyr Ala 


385 










390 










395 


400 


Gly Phe 


Phe 


Leu 


Met 


Ala 


Leu 


He 


Arg 
















405 

















<210> 25 
<211> 368 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> <1)...(363) 

<400> 25 

gca aca ttg ttc atg tgt tgc ttc tct gcc gtc ata get eta ttc aag 4 8 

Ala Thr Leu Phe Met Cys Cys Phe Ser Ala Val He Ala Leu Phe Lys 
15 10 is 



gat att cct gat gtt gat gga gac cga gat ttt ggc ate caa tec ttg 
Asp He Pro Asp Val Asp Gly Asp Arg Asp Phe Gly He Gin Ser Leu 
20 25 30 



cac eta ctt caa aag ate ate act gtg tct ggc cat ggc ctg ctt get 
His Leu Leu Gin Lys He He Thr Val Ser Gly His Gly Leu Leu Ala 
65 70 , 75 80 



tat ttc ctt ata ccg ttt gtg caa taa aattt 
Tyr Phe Leu He Pro Phe Val Gin * 
115 120 



96 



agt gtg aga ttg ggg cca caa aga gtg tat cag etc tgc ata age ata 144 
Ser Val Arg Leu Gly Pro Gin Arg Val Tyr Gin Leu Cys He Ser He 
35 40 45 

ctg tta aca gcc tat ggg get gcc act gta gta gga get tea tec aca 192 
Leu Leu Thr Ala Tyr Gly Ala Ala Thr Val Val Gly Ala Ser Ser Thr 
50 55 60 



240 



gtg aca ctt tgg cag aga gcg egg cac ctt gag gtt gaa aac caa gcg 288 

Val Thr Leu Trp Gin Arg Ala Arg His Leu Glu Val Glu Asn Gin Ala 
85 90 95 

cgt gtc aca tea ttt tac atg ttc att tgg aag eta ttc tat gca aag 336 

Arg Val Thr Ser Phe Tyr Met Phe He Trp Lys Leu Phe Tyr Ala Lys 
100 105 no 



368 



<210> 26 
<211> 120 
<212> PRT 

<213> Triticum aestivum 



<400> 26 

Ala Thr Leu Phe Met Cys Cys Phe Ser Ala Val He Ala Leu Phe Lys 
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1 






5 




10 






15 




Asp 


He 


.Pro 


Asp Val 


Asp Gly Asp Arg 


Asp Phe Gly 


He 


Gin 


Ser 


Leu 








20 


25 






30 






Ser 


Val 


Arg 


Leu Gly 


Pro Gin Arg Val 


Tyr Gin Leu 


Cys 


He 


Ser 


He 






35 




40 




45 








Leu 


Leu 


Thr 


Ala Tyr 


Gly Ala Ala Thr 


Val Val Gly 


Ala 


Ser 


Ser 


Thr 




50 






55 


60 










His 


Leu 


Leu 


Gin Lys 


He He Thr Val 


Ser Gly His 


Gly 


Leu 


Leu 


Ala 


65 








70 


75 








60 


Val 


Thr 


Leu 


Trp Gin 


Arg Ala Arg His 


Leu Glu Val 


Glu 


Asn 


Gin 


Ala 








85 




90 






95 




Arg 


Val 


Thr 


Ser Phe 


Tyr Met Phe He 


Trp Lys Leu 


Phe 


Tyr 


Ala 


Lys 








100 


105 






110 




Tyr 


Phe 


Leu 


He Pro 


Phe Val Gin 
















115 




120 













<210> 27 
<211> 1477 
<212> DNA 

<213> Triticum asestivum 

<220> 
<221> CDS 

<222> (20) . . . (1171) 
<400> 27 

cacgagcccc tccccaccc atg get tec etc gee tec cct ccc gtc ccc tec 52 

Met Ala Ser Leu Ala Ser Pro Pro Val Pro Ser 
1 5 10 

cac gcg ccc ace acc gec get cgc ttc etc ccc gcg ccg gec ggc cgc 100 
His Ala Pro Thr Thr Ala Ala Arg Phe Leu Pro Ala Pro Ala Gly Arg 
15 20 25 

ggc agg cgc ccg teg ccg ccg gec get tea cct ate ttc tec tct get 148 
Gly Arg Arg Pro Ser Pro Pro Ala Ala Ser Pro He Phe Ser Ser Ala 
30 35 40 

tec acc cga ttc acc cag tec ccg cgc gee ccc tgc ggc gec gee cga 196 
Ser Thr Arg Phe Thr Gin Ser Pro Arg Ala Pro Cys Gly Ala Ala Arg 
45 50 55 

ccg cgc tgg cgc gac acc gtg egg gca tgc tct caa get ggt gca get 244 
Pro Arg Trp Arg Asp Thr Val Arg Ala Cys Ser Gin Ala Gly Ala Ala 
60 65 70 75 

ggg cca get cca ctg tea aag aca tta tea gac eta aag gat tec tgc 292 
Gly Pro Ala Pro Leu Ser Lys Thr Leu Ser Asp Leu Lys Asp Ser Cys 
80 85 90 

tgg aga ttt tta agg cca cac aca att cgt gga act get ttg gga tec 340 
Trp Arg Phe Leu Arg Pro His Thr He Arg Gly Thr Ala Leu Gly Ser 
95 100 105 

aca gec ttg gtt get aga gca tta tta gag aat ccc caa ttg ate gat 388 
Thr Ala Leu Val Ala Arg Ala Leu Leu Glu Asn Pro Gin Leu He Asp 
110 115 120 

tgg cgc ttg gta ttc aaa gca tta tat ggc ctt gta get ttg ate tgc 436 
Trp Arg Leu Val Phe Lys Ala Leu Tyr Gly Leu Val Ala Leu He Cys 
125 130 135 
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ggc aac ggt tac att gtt ggg att aat cag ate tat gac att gga att 4 84 

Gly Asn Gly Tyr He Val Gly He Asn Gin He Tyr Asp He Gly He 
140 145 150 155 

gac aag gta aac aaa cca tat tta cct att get gee ggt gat etc tea 532 
Asp Lys Val Asn Lys Pro Tyr Leu Pro He Ala Ala Gly Asp Leu Ser 
160 165 170 

gtt cag tea gca tgg tta ctg gtc gta gca ttc gca gtg gtg ggc ttc 560 
Val Gin Ser Ala Trp Leu Leu Val Val Ala Phe Ala Val Val Gly Phe 
175 180 185 

tea ata gtc gtt tea aac ttt gga cct ttc ate acc tct ctt tac tgc 628 
Ser He Val Val Ser Asn Phe Gly Pro Phe lie Thr Ser Leu Tyr Cys 
190 195 200 

ctt ggt eta ttt ctt ggc act ata tat tct gtt cct cca ttc aga ctg 676 
Leu Gly Leu Phe Leu Gly Thr He Tyr Ser Val Pro Pro Phe Arg Leu 
205 210 215 

aag aga tat cca gtt get get ttt ctt ate att gcg acg gtt cgt gga 724 
Lys Arg Tyr Pro Val Ala Ala Phe Leu lie lie Ala Thr Val Arg Gly 
220 225 230 235 

ttc ctt etc aac ttt ggg gtg tac tat get act aga get gca tta ggt 772 
Phe Leu Leu Asn Phe Gly Val Tyr Tyr Ala Thr Arg Ala Ala Leu Gly 
240 245 250 

ctt aca ttc caa tgg age teg ccc gtt get ttt att aca tgc ttt gtg 820 
Leu Thr Phe Gin Trp Ser Ser Pro Val Ala Phe He Thr Cys Phe Val 
255 260 265 

aca gta ttt get ctg gtc att get ata acc aaa gat ctt ccg gat gtt 868 
Thr Val Phe Ala Leu Val He Ala lie Thr Lys Asp Leu Pro Asp Val 
270 275 280 



gaa ggg gac cgc aaa ttc caa ata . tea act ttg gcg aca aag ctt ggt 916 

Glu Gly Asp Arg Lys Phe Gin lie Ser Thr Leu Ala Thr Lys Leu Gly 
285 290 295 

gtc aga aat att gec ttc ctt ggc tct ggt tta ttg ttg gca aat tat 964 

Val Arg Asn lie Ala Phe Leu Gly Ser Gly Leu Leu Leu Ala Asn Tyr 

300 305 310 315 

gtt gtt get att gta gta cct ttt ctt att cct cag get ttc agg age 1012 

Val Val Ala lie Val Val Pro Phe Leu lie Pro Gin Ala Phe Arg Ser 
320 325 330 

ttt gta atg gtg cct ttt cat get get ctt gca gtt get tta att ttt 1060 

Phe Val Met Val Pro Phe His Ala Ala Leu Ala Val Ala Leu lie Phe 
335 340 345 

cag aca tgg gtt ctg gag caa gca aag tac agt aag gat get att tea 1108 

Gin Thr Trp Val Leu Glu Gin Ala Lys Tyr Ser Lys Asp Ala lie Ser 
350 355 360 

cag tac tac egg ttc ate tgg aac etc ttc tat gec gaa tac ate ttc 1156 

Gin Tyr Tyr Arg Phe lie Trp Asn Leu Phe Tyr Ala Glu Tyr lie Phe 
365 370 375 

ttc ccg tta ata tag agatatggcg tttgacatcg gctacacgat cggagcacgc 1211 
Phe Pro Leu lie * 
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380 - f 

accgaagcac gaattcgttg gggcaacaga agagaaaccc tttgtggtct ataaagcgtg 1271 

agcaattttt gtacatactg tttgactggt aggggaatag agcggcgatg cgacgaggat 13 31 

cttgacgatg ctgtgggagg atccagtaga aaatgactga gttttcgtgg ttgtttctgc 13 91 

caaacaaaga ggaaaagaaa tgaaagtgaa aaggtatcgg gccttgtttt ggagggattg 14 51 

gacgtaaaaa aaaaaaaaaa aaaaca 14 77 

<210> 28 
<211> 383 
<212> PRT 

<213> Triticum asestivum 





<400> 


28 


























Met 


Ala 


Ser 


Leu 


Ala 


Ser 


Pro 


Pro 


Val 


Pro 


Ser 


His 


Ala 


Pro 


Thr 


Thr 


1 








5 










10 










15 




Ala 


Ala Arg 


Phe 


Leu 


Pro 


Ala 


Pro 


Ala 


Gly 


Arg 


Gly 


Arg 


Arg 


Pro 


Ser 








20 










25 










30 






Pro 


Pro 


Ala 


Ala 


Ser 


Pro 


He 


Phe 


Ser 


Ser 


Ala 


Ser 


Thr 


Arg 


Phe 


Thr 






35 










40 










45 






Gin 


Ser 


Pro 


Arg 


Ala 


Pro 


Cys 


Gly 


Ala 


Ala 


Arg 


Pro 


Arg 


Trp 


Arg 


Asp 




50 










55 










60 










Thr 


Val 


Arg 


Ala 


Cys 


Ser 


Gin 


Ala 


Gly 


Ala 


Ala 


Gly 


Pro 


Ala 


Pro 


Leu 


65 










70 










75 










80 


Ser 


Lys 


Thr 


Leu 


Ser 


Asp 


Leu 


Lys 


Asp 


Ser 


Cys 


Trp 


Arg 


Phe 


Leu 


Arg 










85 










90 










95 




Pro 


His 


Thr 


He 


Arg 


Gly 


Thr 


Ala 


Leu 


Gly 


Ser 


Thr 


Ala 


Leu 


Val 


Ala 








100 










105 










110 






Arg 


Ala 


Leu 


Leu 


Glu 


Asn 


Pro 


Gin 


Leu 


He 


Asp 


Trp 


Arg 


Leu 


Val 


Phe 






115 










120 










125 








Lys 


Ala 


Leu 


Tyr 


Gly 


Leu 


Val 


Ala 


Leu 


He 


Cys 


Gly 


Asn 


Gly 


Tyr 


He 




130 










135 










140 






Val 


Gly 


He 


Asn 


Gin 


He 


Tyr 


Asp 


He 


Gly 


He 


Asp 


Lys 


Val 


Asn 


Lys 


145 










150 










155 










160 


Pro 


Tyr 


Leu 


Pro 


He 


Ala 


Ala 


Gly 


Asp 


Leu 


Ser 


Val 


Gin 


Ser 


Ala 


Trp 










165 










170 










175 


Leu 


Leu 


Val 


Val 


Ala 


Phe 


Ala 


Val 


Val 


Gly 


Phe 


Ser 


He 


Val 


Val 


Ser 








180 










185 










190 






Asn 


Phe Gly 


Pro 


Phe 


He 


Thr 


Ser 


Leu 


Tyr 


Cys 


Leu 


Gly 


Leu 


Phe 


Leu 






195 










200 










205 








Gly 


Thr 


He 


Tyr 


Ser 


Val 


Pro 


Pro 


Phe 


Arg 


Leu 


Lys 


Arg 


Tyr 


Pro 


Val 




210 










215 










220 










Ala 


Ala 


Phe 


Leu 


He 


He 


Ala 


Thr 


Val 


Arg 


Gly 


Phe 


Leu 


Leu 


Asn 


Phe 


225 










230 










235 










240 


Gly 


Val 


Tyr 


Tyr 


Ala 


Thr 


Arg 


Ala 


Ala 


Leu 


Gly 


Leu 


Thr 


Phe 


Gin 


Trp 










245 










250 










255 


Ser 


Ser 


Pro 


Val 


Ala 


Phe 


He 


Thr 


Cys 


Phe 


Val 


Thr 


Val 


Phe 


Ala 


Leu 








260 










265 










270 






Val 


He 


Ala 


He 


Thr 


Lys 


Asp 


Leu 


Pro 


Asp 


Val 


Glu 


Gly 


Asp 


Arg 


Lys 






275 










280 










285 






Phe 


Gin 


He 


Ser 


Thr 


Leu 


Ala 


Thr 


Lys 


Leu 


Gly 


Val 


Arg 


Asn 


He 


Ala 




290 










295 










300 










Phe 


Leu Gly 


Ser 


Gly 


Leu 


Leu 


Leu 


Ala 


Asn 


Tyr 


Val 


Val 


Ala 


He 


Val 


305 










310 










315 










320 


Val 


Pro 


Phe 


Leu 


He 


Pro 


Gin 


Ala 


Phe 


Arg 


Ser 


Phe 


Val 


Met 


Val 


Pro 










325 










330 










335 




Phe 


His 


Ala 


Ala 


Leu 


Ala 


Val 


Ala 


Leu 


He 


Phe 


Gin 


Thr 


Trp 


Val 


Leu 








340 










345 










350 






Glu 


Gin 


Ala 


Lys 


Tyr 


Ser 


Lys 


Asp 


Ala 


He 


Ser 


Gin 


Tyr 


Tyr 


Arg 


Phe 






355 










360 










365 








He 


Trp Asn 


Leu 


Phe 


Tyr 


Ala 


Glu 


Tyr 


He 


Phe 


Phe 


Pro 


Leu 


He 






370 










375 










380 
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<210> 29 
<211> 473 
<212> DNA 

<213> Triticum aestivum 
<220> 

<221> intron 

<222> (322) . . . (426) 

<400> 29 

gcaacattgt tcatgtgttg cttctctgcc gtcatagctc tattcaagga tattcctgat 60 

gttgatggag accgagattt tggcatccaa tccttgagtg tgagattggg gccacaaaga 120 

gtgtatcagc tctgcataag catactgtta acagcctatg gggctgccac tgtagtagga 180 

gcttcatcca cacacctact tcaaaagatc atcactgtgt ctggccatgg cctgcttgct 240 
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atgcagctat tctatgcaaa gtatttcctt ataccgtttg tgcaataaaa ttt 473 
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